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INTRODUCTION 



1.1 TRELLIS REPRESENTATION OF CODES 

A code trellis is a graphical representation of a code, block or convolutional, in 
which every path represents a codeword (or a code sequence for a convolutional 
code). This representation makes it possible to implement maximum likelihood 
decoding (MLD) of a code with reduced decoding complexity. The most well 
known trellis-based MLD algorithm is the Viterbi algorithm [23, 79, 105]. The 
trellis representation was first introduced and used for convolutional codes [23]. 
This representation, together with the Viterbi decoding algorithm, has resulted 
in a wide range of applications of convolutional codes for error control in digital 
communications over the last two decades. 

The recent search for efficient MLD schemes for linear block codes has moti- 
vated some coding theorists to study the trellis structure of these codes so that 
trellis-based decoding algorithms can be devised to reduce decoding complexity. 
Trellis representation of linear block codes was first presented in [1] and then 
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2 TRELLISES AND TRELLIS-BASED DECODING ALGORITHMS 

in [67, 109]. The first serious study of trellis structure and trellis construction 
for linear block codes was due to Wolf. In his 1978 paper [109], Wolf presented 
the first method for constructing trellises for linear block codes and proved that 
an JV-section trellis diagram for a g-ary (JV, K) linear block code has at most 
g mm {K y N-K) states. He also presented a method for labeling the states based on 
the parity-check matrix of a code. Right after Wolf’s work, Massey presented 
a simple but elegant paper [67] in which he gave a precise definition of a code 
trellis, derived some fundamental properties, and provided implications of the 
trellis structure for encoding and decoding of codes. However, these early works 
in trellis representation of linear block codes did not arouse much enthusiasm, 
and for the next 10 years, there was basically no research in this area. 

There are two major reasons for this inactive period of research in this area. 
First, most coding theorists at that time believed that block codes did not have 
simple trellis structure like convolutional codes and maximum likelihood decod- 
ing of linear block codes using the Viterbi algorithm was practically impossible, 
except for very short block codes. Second, since almost all of the linear block 
codes are constructed algebraically or based on finite geometries, it was the 
belief of many coding theorists that algebraic decoding was the only way to 
decode these codes. These two reasons seriously hindered the development of 
efficient soft-decision decoding methods for linear block codes and their appli- 
cations to error control in digital communications. This led to a general belief 
that block codes are inferior to convolutional codes and hence, that they were 
not useful. 

In fact, for more than two decades, most of the practicing communication 
engineers believed that the rate- 1/2 convolutional code of constraint length 7 
with Viterbi decoding was the only effective error control coding scheme for 
digital communications, except for perhaps ARQ schemes. To achieve higher 
reliability for certain applications such as NASA’s satellite and deep space 
communications, this convolutional code concatenated with a Reed-Solomon 
outer code was thought the best solution. 

It was really Forney’s paper in 1988 [24] that aroused enthusiasm for research 
in the trellis structure of linear block codes. In this paper, Forney showed that 
some block codes, such as Reed-Muller (RM) codes and some lattice codes, 
do have relatively simple trellis structures, and he presented a method for con- 
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INTRODUCTION 3 


structing sectionalized trellises for linear block codes and asserted that the con- 
struction results in minimal trellises with respect to the state complexity (the 
number of states). Motivated by Forney’s work and the desire to achieve max- 
imum likelihood decoding of linear block codes to improve error performance 
over traditional hard-decision algebraic decoding, there have been significant 
efforts in studying the trellis structure and devising trellis-based decoding algo- 
rithms for linear block codes over the last eight years. Developments have been 
dramatic and rapid, and the new results are exciting and encouraging. Trellis- 
based decoding algorithms that are more efficient than the conventional Viterbi 
decoding algorithm have recently been devised [32, 37] and implementation of 
trellis-based high-speed decoders for NASA’s high-speed satellite communica- 
tions is now underway [52, 63]. All of these new developments make block codes 
more competitive with convolutional codes. 


1.2 ORGANIZATION OF THE BOOK 

Chapter 2 gives a brief review of linear block codes. The goal is to provide 
the essential background material for the development of trellis structure and 
trellis-based decoding algorithms for linear block codes in the later chapters. 
Chapters 3 through 6 present the fundamental concepts, finite-state machine 
model, state space formulation, basic structural properties, state labeling, con- 
struction procedures, complexity, minimality, and sectionalization of trellises. 
Chapter 7 discusses trellis decomposition and subtrellises for low-weight code- 
words. Chapter 8 first presents well known methods for constructing long 
powerful codes from short component codes or component codes of smaller 
dimensions, and then provides methods for constructing their trellises which 
include Shannon and Cartesian product techniques. Chapter 9 deals with con- 
volutional codes, puncturing, zero-tail termination and tail-biting. It shows 
that trellis construction procedures for both block and convolutional codes are 
essentially the same, except that the trellises for convolutional codes or termi- 
nated convolutional codes are time- invariant and the trellises for block codes 
are in general time-varying. For both types of codes, trellis states are defined 
based on a certain set of information bits, called the state-defining information 
set. 
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4 TRELLISES AND TRELLIS-BASED DECODING ALGORITHMS 


Chapters 10 through 13 present various trellis-based decoding algorithms, 
old and new. Chapter 10 first discusses the application of the well known 
Viterbi decoding algorithm to linear block codes, optimum sectionalization of 
a code trellis to minimize computation complexity, and design issues for IC 
(integrated circuit) implementation of a Viterbi decoder. Then it presents a 
new decoding algorithm for convolutional codes, named differential trellis de- 
coding (DTD) algorithm. DTD algorithm is devised based on the principle of 
compare-select-add (CSA) which is simply the opposite of the principle of 
add-compare-select (ACS) used in the Viterbi algorithm. This new algo- 
rithm is more efficient than the Viterbi decoding algorithm. For rate- 1/2 an- 
tipodal convolutional codes and their higher rate punctured codes, it requires 
about 1/3 less real number operations than the Viterbi decoding algorithm. 
This DTD algorithm can also be applied to trellis decoding of block codes. 
Chapter 11 presents a trellis-based recursive MLD for linear block codes, the 
RMLD algorithm. This decoding algorithm is devised based on the divide 
and conquer principle. The implementation of this algorithm does not re- 
quire the construction of the entire code trellis; only some special one-section 
trellises of much smaller state and branch complexities for constructing path 
metric tables recursively are needed. This reduces the decoding complexity 
significantly and it is more efficient than the Viterbi decoding algorithm. Fur- 
thermore, it allows parallel/pipeline processing of received sequences to speed 
up decoding. Chapter 12 presents a suboptimum reliability- based iterative de- 
coding algorithm with a low-weight trellis search for the most likely codeword. 
This decoding algorithm provides a good trade-off between error performance 
and decoding complexity. All the decoding algorithms presented in Chapters 10 
through 12 are devised to minimize word error probability. Chapter 13 presents 
decoding algorithms that minimize bit error probability and provide the corre- 
sponding soft (reliability) information at the output of the decoder. Decoding 
algorithms presented are the MAP (maximum a posteriori probability) 
decoding algorithm and the SOVA (soft-output Viterbi algorithm) algo- 
rithm. Finally, the minimization of bit error probability in trellis-based MLD 
is discussed. 
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LINEAR BLOCK CODES 


Chapter 2 gives a brief review of linear block codes. The goal is to provide 
the essential background material for the development of trellis structure and 
trellis-based decoding algorithms for linear block codes in the later chapters. 
We mainly present the basic concepts of encoding and decoding of linear block 
codes and state some facts without derivations or proofs. Since in most present 
digital data communication systems, information is coded in binary digits, ‘O’ 
or 4 1\ we discuss only linear block codes with symbols from the binary field 
GF(2). First, linear block codes are defined and described in terms of generator 
and parity-check matrices. Second, coset partition of a linear block code is dis- 
cussed, which is needed in analyzing the code trellis structure and construction. 
Third, the concepts of minimum distance, weight distribution and distance pro- 
file are presented, which are needed in the later chapters for presenting decoding 
algorithms and their error performances. Finally, the concepts of hard-decision, 
soft-decision, and maximum likelihood decoding are presented. 
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6 TRELLISES AND TRELLIS-BASED DECODING ALGORITHMS 


References 3, 9, 14, 59, 63, 78 and 79 contain excellent treatments of linear 
block codes. 

2.1 GENERATION OF LINEAR BLOCK CODES 

In block coding, an information sequence of binary digits (called bits) is divided 
into message blocks of fixed length; each message block consists of K informa- 
tion bits. There are a total of 2 K distinct messages. Each message is encoded 
into a codeword (or code sequence) of N bits according to certain rules, where 
N > K. Therefore, corresponding to the 2 K possible messages, there are 2 K 
codewords. This set of 2 K codewords forms a block code of length N. For 
a block code to be useful, the 2 K codewords must be distinct. Hence, there 
should be a one-to-one correspondence between a message and a codeword. 

Definition 2.1 A binary block code of length N and 2 K codewords is called an 
( jV, K) linear block code if and only if its 2 K codewords form a if- dimension a] 
subspace of the vector space of all the iV-tuples over the binary field GF(2). 
The parameter K is called the dimension of the code space. 

An (N y K) linear block code C is generated by a if x iV generator matrix 
over GF(2), 


’ 9 1 


9n 

9\2 

9u * 

* 9lN 

9 2 

=z 

<?21 

922 

' * 92i * 

92N 

. 9 k . 


9KI 

9K2 • 

* * 9Ki * 

9kn 


where the rows, g x , g 2 i - • M 9k' are linearly independent over GF(2). The 2 K 
linear combinations of the K rows of G form the codewords of C. We say that 
the rows of G span the code C, or C is the row space of G. Let 

a = (ai,a 2 ,... ,ajf) 

be a message to be encoded. A natural encoding mapping is that the codeword 

U = (u 1 ,U 2 ,...,Utv) 

for the message a — (ax , a 2l . . . , ax) is given by 
u — a ■ G 
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LINEAR BLOCK CODES 7 


= (<*1,02, . . . ,Ojf) 


9l 
9 2 


L 9 k J 

= oi - 5i+02 - 52H +< 1 K'9k- 


( 2 . 2 ) 


From (2.1) and (2.2), we find that for 1 < i < AT, the i-th component of u is 
given by 

Ui = ai * gu + ■ 52^ H l-ojr ( 2 * 3 ) 


During an encoding interval, K information bits are encoded into N code 
bits. These N code bits are shifted onto the channel, one at a time, in N units 
of time. An encoding interval, denoted T, is represented by a set of /V + 1 time 
instants, 

r = {0,1,2 (2.4) 

For 1 < i < the i-th unit of time is the interval from tim e-(i — 1) to time-i. 
During this interval, the i-th code bit u* is formed and transmitted. By time-i, 
the transmission is completed. This interval is called a bit interval. 

Example 2.1 Consider a binary (8, 4) linear block code which is generated by 
the following generator matrix: 
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If a = (HOI) is the message to be encoded, its corresponding codeword, ac- 
cording to (2.2), is given by 


u = 1 * 9i + 1 92 + 0 * 93 + 1 * 9\ 

= ( 11111111 ) + ( 00001111 )+ ( 01010101 ) 
= ( 10100101 ). 

The 16 codewords of this code are listed in Table 2.1. 


AA 
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8 TRELLISES AND TRELLIS-BASED DECODING ALGORITHMS 


Table 2.1. The codewords of the code generated by (2.5). 


Messages Codewords Messages Codewords 


( 0000 ) ( 00000000 ) 
( 1000 ) ( 11111111 ) 
( 0100 ) ( 00001111 ) 
( 1100 ) ( 11110000 ) 
( 0010 ) ( 00110011 ) 
( 1010 ) ( 11001100 ) 
( 0110 ) ( 00111100 ) 
( 1110 ) ( 11000011 ) 


( 0001 ) ( 01010101 ) 
( 1001 ) ( 10101010 ) 
( 0101 ) ( 01011010 ) 
( 1101 ) ( 10100101 ) 
( 0011 ) ( 01100110 ) 
( 1011 ) ( 10011001 ) 
( 0111 ) ( 01101001 ) 
(ini) (looiono) 


A binary ( N , K) linear block code C is also uniquely specified by an ( N — 
K) x N matrix over GF(2), called a parity-check matrix, 


H = 


fin 

fi2i 


fil2 

/l22 


filN 

h-2N 


fijv-x\l fiN-K, 2 


fiw-ir.jv 


( 2 . 6 ) 


where the rows are linearly independent. A binary TV-tuple u = (vi, U 2 * . . . , us) 
is a codeword in C if and only if the following condition holds: 


u • H t = 0, 


(2.7) 


where 0 denotes the all-zero (TV — K)- tuple, (0,0,..., 0). Code C is called 
the dual (or null) space of H. H itself generates an (TV, TV — K) linear code, 
denote C x . For any codeword u — (u lf U 2 , . . . € C and any codeword 

v — (i>i , t; 2 , . . . , vs ) € C the inner product 

u - v = Ui • V\ + u 2 • v 2 H + un ' vn 

= 0 . 


C 1 - is called the dual code of C and vise versa. 
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LINEAR BLOCK CODES 9 


In general, the generator matrix G of a linear block code C is used for 
encoding, while the parity-check matrix H is used for decoding, particularly 
for error detection. 


Example 2.2 Consider the (8,4) linear block code given in Example 2.1. A 
parity-check matrix for this code is the generator matrix itself given by (2.5), 
i.e., 

“ 11111111 “ 

H = G = 00001111 
0 0 1 1 0 0 1 1 
. 01010101 , 

In this case, C = C L and C is said to be self-dual. 


AA 


For an (N> K) linear block code G, the ratio R = K/N is called the code 
rate which represents the average number of information bits carried by a code 
symbol (or the average number of information bits transmitted per channel 
usage) . 


2.2 COSET PARTITION OF A LINEAR BLOCK CODE 

Consider a binary (N y K) linear block code C with a generator matrix G. Let 
K\ be a nonnegative integer such that 0 < K\ < K. A subset of 2 Kl codewords 
in C is said to be a linear subcode of C if this subset itself is a K\ -dimensional 
subspace of the vector space of all the iV-tuples over GF(2). Any K\ rows of 
the generator matrix G span an (iV, K\) linear subcode of G, and they form a 
generator matrix for the subcode. If K\ = 0, the subcode consists of only the 
all-zero codeword 0 of G. For K\ = AT, the subcode is just the code itself. 

Let Ci be an ( N , K i) linear subcode of G. Then G can be partitioned into 
2 K ~ Kx disjoint cosets of C\\ each coset is of the following form: 

Ui0Gi = {t>i + u : u e Ci) (2.8) 

with 1 < / < 2 k ~ k \ where for Vi ^ 0, t>i is in G but not in Gi and for vi = 0, 
the coset 0 0 C\ is just the subcode C\ itself. This partition of G with respect 
to C\ is denoted with G/Gi, and the codewords Vi for 1 < / < 2 K ~ Kx are called 
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10 TRELLISES AND TRELLIS-BASED DECODING ALGORITHMS 


the coset representatives. Any codeword in a coset can be used as the coset 
representative without changing the composition (the codewords) of the coset. 
Important properties of cosets are: 

(1) The sum of two codewords in a coset is a codeword in the subcode C\. 

(2) Let x and y be two codewords in cosets Ui©Ci and Vj ©Cj , respectively, 
where i ^ j. Then the sum x+y is a codeword in the coset (vi + v ; )0Ci 
with Vi + Vj as the coset representative. 

The set of representatives for the cosets in the partition C/C\ is denoted [ C/C\ ] 
which is called the coset representative space for the partition CjC\. Code 
C can be expressed as the direct-sum of C\ and [C/Ci] as follows: 

C = [C/Ci\ © Ci = {v + u : v e [CjCi] and 1 1 € C\}. (2.9) 

Let G\ be the subset of K\ rows of the generator matrix G which generates 
the subcode C\. Then the 2 K ~ Kl codewords generated by the K — K x rows in 
the set G\Gi can be used as the representatives for the cosets in the partition 
C/C \ . These 2 K ~ Kx codewords form an (N, K — K\) linear subcode of C. 

Let C 2 be an linear subcode of Ci with 0 < K 2 < K\. We can 

further partition each coset Vi 0 C\ in the partition C/C\ based on C 2 into 
2^1 -k 3 cose t s of C 2 ; each coset consists of the following codewords in C: 

Vi © ( w k 0 C 2 ) = + Wk + u : u £ C 2 } (2.10) 

with 1 < / < 2 K ~ Kl and 1 < k < 2 Kx ~ K * where for Wk ^ 0, Wk is a codeword 
in C\ but not in C 2 . We denote this partition with C/Ci/C 2 . This peirtition 
consists of 2 k ~ k * cosets of C 2 . Now C can be expressed as the following 
direct-sum: 

C — [C/Ci] 0 \Ci/C 2 \ 0 C 2 . (2.11) 

Let Ci,C 2j ... ,C m be a sequence of linear subcodes of C with dimensions 
K\, K 2y . . . , Km, respectively, such that 

CDC x DC 2 D-DC m (2.12) 

and 

K > Ki > K 2 > • • • > K m > 0. (2.13) 
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LINEAR BLOCK CODES 11 


Then we can form a chain of partitions, 

C/C u C/Cj/Ca, . . . , C/Ci/Ca/ • • • /C m , (2.14) 

and C can be expressed as the following direct-sum: 

C = [C/Ci] e [Ci/Ct] © • • • © [C m _!/C m ) © C m . (2.15) 


2.3 THE MINIMUM DISTANCE AND WEIGHT DISTRIBUTION OF A 
LINEAR BLOCK CODE 

Let u and v be two fV-tuples over GF(2). The Hamming distance between 
u and u, denoted d(u,v), is defined as the number of places where they differ. 
The minimum (Hamming) distance of a block code C, denoted d, n j n (C), 
is defined as the minimum Hamming distance between all distinct pairs of 
codewords in C, i.e., 

d min (C) = min{d(tt, v) : u y v £ C,u ^ v}. (2.16) 

The (Hamming) weight of an N- tuple v, denoted u?(v), is defined as the 
number of nonzero components of v. It follows from the definition of Hamming 
distance and the fact that the sum of two 7V-tuples over GF(2) is an another 
TV-tuple over GF(2) that the Hamming distance between two TV-tuples, u and 
v, is equal to the Hamming weight of the sum of u and v, i.e., 

d(u, v) = u>(u + v). (2.17) 

For a linear block code C, it follows from (2.16) and (2.17) that 

riming) = min{ty(u + v) : u,v € C y u ^ v} 

= min{u;(x) : x € C, x ^ 0} 

± Wmin(C). ( 2 . 18 ) 

The parameter w min (C) = min{u;(x) : x € C,x ^ 0} is called the minimum 
weight of C. Eq.(2.18) simply says that the minimum distance of a linear block 
code is equal to the minimum weight of its nonzero codewords. The minimum 
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weight of the (8,4) linear block code given in Table 2.1 is 4; therefore, its 
minimum distance is 4. 

Let C be an (N, K) linear block code. For 0 < i < N, let A y be the number 
of codewords with weight i. The numbers Ao,Ai,A2, . . . , As are called the 
weight distribution of C . It is clear that Ao = 1. The weight distribution of 
the (8,4) linear block code given in Table 2.1 is 

Aq = 1, A i = A2 — A3 = 0, A 4 — 14, A5 = Aq — Ai — 0, A% — 1. 

The weight distribution is often expressed as a polynomial, 

A(X) = Ao + A r X + • • • + A N X N , 

which is called the weight enumerator of C. Let W = {0, icq, u? 2 > • • ■ ,u> m } 
denote the set of all weights of codewords in C such that: 

(i) 0 < wi < w 2 < * • • < Wm < N\ and 

(ii) For 1 < : < m, the number of codewords in C with weight is not 
equal to zero. 

This set is called the weight profile of C. The weight profile of the (8,4) 
linear block code given by Table 2.1 is {0,4,8}. Let u be any codeword in 
C. The weight distribution of C actually gives the distribution of distances of 
codewords in C from the codeword u. The weight profile W of C gives the 
profile of distances of codewords in C from the codeword u . 

The error performance of a linear block code is determined by its minimum 
distance and weight distribution. For an ( N t K) linear block code with min- 
imum distance d m - m , we often use the notation (iV, K> d m i n ) to represent the 
code. Therefore, the code given by Table 2.1 is an (8,4,4) linear block code. 

2.4 DECODING 

Suppose an (AT, K) linear block code C is used for error control over an ad- 
ditive white Gaussian noise (AWGN) channel. Let u = (tii,U2, . . . ,u^) be 
the codeword to be transmitted. Before the transmission, a modulator maps 
each code bit into an elementary signal waveform. Binary PSK or FSK are 
commonly used signal waveforms for transmitting the bits in a codeword. The 


DRAFT 


January 6, 1998, 8:40pm 


DRAFT 


LINEAR BLOCK CODES 13 


resultant signal sequence is then transmitted over the channel and corrupted 
by noise. At the receiving end, the received signal sequence is processed by a 
demodulator and sampled at the end of each signal (bit) interval. This results 
in sequence of real numbers, 

r = (ri,r 2 ,...,rAr), 

which is called the received sequence. For 1 < i < AT, the i-th received 
component is the sum of a fixed real number C{ and a Gaussian random 
variable of zero-mean and variance No/2 where c* corresponds to the trans- 
mitted code bit u; at time-i. These received components may or may not be 
quantized. At one extreme, the demodulator can be used to make firm deci- 
sions on whether each transmitted code bit is a or a ‘1\ Thus the output 
is quantized to two levels, denoted as 0 and 1. We say that the demodulator 
has made a “hard decision” on each transmitted code bit. This hard decision 
results in a binary received sequence, 

which may contain transmission errors, i.e., for some i, Z{ ^ U{. This bi- 
nary hard-decision sequence is fed into a decoder which attempts to correct the 
transmission errors (if any) and recover the transmitted codeword u. Since the 
decoder operates on the hard decisions made by the demodulator, the decoding 
process is called hard-decision decoding. At the other extreme, the unquan- 
tized outputs from the demodulator can be fed directly into the decoder for pro- 
cessing. We refer to the resulting decoding as soft-decision decoding. Since 
the decoder makes use of the additional information contained in the unquan- 
tized received samples to recover the transmitted codeword, soft-decision decod- 
ing provides better error performance than hard-decision decoding. Decoding 
based on the quantized outputs from the demodulator, where the number of 
quantization levels exceeds two, is also referred to as soft-decision decoding. 
Soft-decision decoding provides better error performance than hard-decision 
decoding; however, hard-decision decoding is much simpler to implement. Var- 
ious hard-decision decoding algorithms based on the algebraic structures of 
linear block codes have been devised. These hard-decision decoding algorithms 
are also termed algebraic decoding algorithms. Recently, effective soft-decision 
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decoding algorithms have been devised, and they achieve either optimum error 
performance or suboptimum error performance with reduced decoding com- 
plexity. 

Let u m be the estimate of the transmitted codeword at the output of the 
decoder. If the codeword u was transmitted, a decoding error occurs if and 
only if u m ^ u. Given that r is received, the conditional error probability of 
the decoder is defined as 


P(E | r) ^ P{u m u | r). 

(2.19) 

The error probability of the decoder is then given by 

P(E) = 52P(E|r)P(r). 

(2.20) 


P(r) is independent of the decoding rule used since r is produced prior to 
decoding. Hence, an optimum decoding rule (i.e., one that minimizes P(E)) 
must minimize P(E | r) = P(u * ^ u \ r) for all r. Since minimizing P(u* ^ 
u | r) is equivalent to maximizing P(u m = u | r), P(E | r) is minimized for a 
given r by choosing u* as the codeword that maximizes 


P(u | r) = 


P(r | u)-P(tt) 
P(r) 


( 2 . 21 ) 


that is, u m is chosen as the most likely codeword given r is received. If all 
the codewords are equally likely, maximizing (2.21) is equivalent to maximizing 
P(r | u). For an AWGN channel, 


N 

P(r I «) = n p ( r < I “<)« ( 2 - 22 ) 

*=1 


since each received symbol depends on the corresponding transmitted symbol. 
A decoder that chooses its estimate to maximize (2.22) is called a maximum 
likelihood decoder and the decoding process is called the maximum like- 
lihood decoding (MLD). Maximizing (2.22) is equivalent to maximizing 


N 

log P(r I tt) = 52 lo 8 P ( r ' I u *) ( 2 - 23 ) 

«=i 

which is called the log-likelihood function. 
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Suppose BPSK signaling is used. Assume that each signal has unit energy. 
Let u = (ttijiij,... be the codeword to be transmitted. The modulator 
maps this codeword into a bipolar sequence represented by 

c = (ci,c 2 ,... ,c*r) 



where for 1 < i < N, 

C{ = 2u{ — 1. 

(2.24) 

— 

From (2.24), we see that 



— 

Ci = 

f - 1 , if Ui = 0, 
\ +1, if Uj — 1. 

(2.25) 


The squared Euclidean distance between the received sequence r = (r^, r 2 , 
. . rtf) and c is given by 


|r-c| 2 ^f>- Ci ) 2 - (2-26) 

1=1 

For an AWGN channel, maximizing the log- likelihood function is equivalent to 
minimizing the squared Euclidean distance between r and c. If we expand the 
right-hand side of (2.26), we have 

i r - c i 2 = ]C r i - 2 H r * -ci+ ]C c ?- ( 2 - 27 ) 

1=1 t=i i=i 

In computing |r — c| 2 for all codewords in C, r? is a common term and 

cj = N. Therefore, minimizing |r — c| 2 of (2.26) is equivalent to maxi- 
mizing 

N 

r c = ^ r { ■ Cj 

i=i 

N 

= 2u,-l). (2.28) 

«=i 

The inner product given by (2.28) is called the correlation between the re- 
ceived sequence r and the codeword u. 
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Furthermore, (2.28) can be expanded as follows: 

jv n 

r • c = 2^Tri -Ui - (2.29) 

1=1 1=1 

Since the second term in (2.29) is a common term in computing r • c 

for all codewords in C, maximizing r • c is equivalent to maximizing 

N 

r-u = ^r*-ui. (2.30) 

i=i 

The inner product given by (2.30) is called the binary correlation between 
the received sequence r and the codeword t*. 

Summarizing the above, MLD can be stated in four equivalent ways: 

(1) Log- likelihood function 

Decode the received sequence r into a codeword u for which the log- 
likelihood function 

N 

log P(r I u) = log />(»•* I u.) 

1=1 

is maximized. 

(2) Squared Euclidean distance 

Decode the received sequence r into a codeword u for which the squared 
Euclidean distance 

|r — u| 2 = — (2uj — l)) 2 

i— 1 

is minimized, 

(3) Correlation function 

Decode the received sequence r into a codeword u for which the corre- 
lation function 

N 

m(r, u) ± ^ rj • (2 u* - 1 ) 

»=i 

is maximized. 
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(4) 


Binary correlation function 

Decode the received sequence r into a codeword u for which the binary 
correlation function 

N 

b ( r » u ) = X^ r ‘ u * 


is maximized. 


2.5 REED-MULLER CODES 

Reed-Muller (RM) codes form a class of multiple error-correction codes. These 
codes were discovered by Muller in 1954 [78], but the first decoding algorithm 
for these codes was devised by Reed, also in 1954 [83]. They are finite geometry 
codes and rich in algebraic and geometric structures. The purpose of including 
these codes in this reviewing chapter is that they have very simple and regular 
trellis structures and their trellises can be easily constructed. These codes can 
be decoded effectively with trellis-based decoding algorithms. Furthermore, 
they provide good example codes. Throughout this book, many example codes 
are RM codes. 

For any nonnegative integers m and r with 0 < r < m, there exists a binary 
r-th order RM code, denoted RM rifn , with the following parameters: 

Length N rtTn — 2 m 

Dimension /T r ,m = 1 + (7) *+ + (7) 

Minimum distance d r>m = 2 m_r . 

In the following, we first present the original construction of RM codes and 
then we describe an alternate construction for these codes. For 1 < i < m , let 
v x be a 2 m -tuple over GF(2) of the following form: 

v, = (2.31) 

2'“ 1 2'” 1 2 ‘-* 2 i “ 1 

which consists of 2 m ~* +1 alternate all-zero and all-one 2 l_1 -t U ples. For m = 3, 
we have the following three 8-tuples: 

v 3 = (00001111), 

t> 2 = (00110011), 

tn = (01010101). 
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Let a = (ai, d2» . < . ,a/y) and b = (bj, 62, * « • >&n) be two binary iV-tuples. 
Define the following logic (boolean) product of a and 6, 


a « 6 = (a x • 61, a 2 • 6 2 , . . . • 6*), 


where *•' denotes the logic product (or AND operation), i.e. a, • 6^ = 1 if and 
only if both and bi are ‘1\ For m = 3 , 

v 3 • vi = (00000101). 


For simplicity, we use ab for a • fr. 

Let 1 denote the all-one 2 m -tuple, 1 = (1, 1, . . . , 1). For 1 < ii < < * < 

ii < m, the product 


is said to have degree l. Since the weights of tii, t>2» • • . , t/ m are even and powers 
of 2, it can be shown that the weight of the product is also even 

and a power of 2, in fact 2 m ~ L . 

The r-th order RM code, RM rjm , of length 2 m is generated by the following 
set of vectors: 

Grm(»%™) = {1, Vl,V2,. . . - • , 

... up to products of degree r}. ( 2 . 32 ) 


There are 



vectors in Grm(t% rn) and they are linearly independent. If the vectors in 
C ? RM( r ) m ) are arranged as rows of a matrix, then the matrix is a generator 
matrix of the RM code, RM r , m * For 0 < / < r, there are exactly (™) rows 
in Grm(^,^) of weight 2 m ~*. All the codewords of the RM code, RM r>m with 
0 < r < m, have even weights. It is also clear that the (r - l)-th order RM 
code, RM r _i >m , is a proper subcode of the r-th order RM code, RM r?m . 
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Example 2.3 Let m — 4. The 2nd order RM code of length 16 is generated 
by the following 1 1 vectors: 


Vq = 1 

1111111111111111 

t>4 

0000000011111111 

V3 

0000111100001111 

V2 

0011001100110011 

Vl 

0101010101010101 

v z v 4 

0000000000001111 

V 2 v 4 

0000000000110011 

ViV A 

0000000001010101 

V2V3 

0000001100000011 

V 1 V 3 

0000010100000101 

v x v 2 

0001000100010001 


This is a (16, 11) code with minimum distance 4. 


AA 


The code given in Example 2.1 is the 1st order RM code, RMi t 3 , of length 8. 
Let 

< 3 ( 2 , 2 ) = 

be a 2 x 2 matrix over GF(2). The two-fold Kronecker product of <3(2,2) is 
defined as 


(2.33) 


G{2 3 ,2 2 ) 


A 


1 1 
0 1 


® 


1 1 
0 1 


"1111 
0 10 1 
0 0 11 
0 0 0 1 


(2.34) 


where ® denotes the Kronecker product. The 3-fold Kronecker product of 
<3(2,2) ls defined as 

< 3 ( 2 3 , 2 3 ) — 


1 1 
0 1 


® 


1 1 
0 1 




1 1 
0 1 
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11111111 
0 10 10 10 1 
0 0 1 1 0 0 1 1 

0 0 0 1 0 0 0 1 , 

0 0 0 0 1 1 1 1 ( 2 ‘ 35 ^ 

0 0 0 0 0 1 0 1 

0 0 0 0 0 0 1 1 

00000001 

Similarly, we can define the m-fold Kronecker product of G( 2l2 )* Let N = 2 m . 
We use to denote the m-fold Kronecker product of £( 2 , 2 )- ** a 

2 m x 2 m matrix over GF(2). The rows of G (u,N) have weights, 2°, 2 1 , 2 2 , . . . , 2 m , 
and the number of rows with weight 2 m “* is (™) for 0 < / < m. 

The RM codes of length N = 2 m can be expressed in terms of the rows of 
G(s t N)- Let Grm ( r ,m) denote the matrix formed by the rows of ,/y) with 
weights equal to or greater than 2 m_r . Then Grm^to) is a generator matrix 
for the r-th order RM code, RM riTn , of length N = 2 m [24]. Actually, Grm (r, m) 
is the same set of rows as that given in (2.32). In the above construction of 
RM codes, we can also set the base matrix G< 2 , 2 ) °f (2.33) as 

1 0 ' 

1 1 

For m = 3 and r = 1, the rows of weights 4 and 8 in G^ y 2 s ) (2.35) form 

the generator matrix of the 1st order (8,4) RM code given in Example 2.1. 

Let u = (ui,ii 2 , . . . , ns) and v = (ui,u 2 , . . . , u#) be two N-tuples over 
GF(2). From u and v, we form the following binary 27V-tuple: 

|u|u + t)| = ,U N ,Ui +!>!,... ,Uj v + t>w). (2.36) 

Let C\ and C 2 be an (N,Ki t d\) and an {N, K 2 , d 2 ) binary linear codes, re- 
spectively. Assume that d 2 > d\. Form the following code: 

ICjICi -I- Ci | = {|«|ti + u| : u € Ci and t> € C 2 }. (2.37) 

Then |Ci|Ci + C 2 | is an (2N,Ki + K 2 ,d) binary linear code with 



d = min{2d 1 ,d2}. 


(2.38) 
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The above construction of a code from two component codes is called the 
|u|u + v (-construction [66] which is a powerful technique for constructing pow- 
erful long codes from short codes. 

RM codes of length 2 m can be constructed from RM codes of length 2 m " 1 
using the |u|ti -f v \ -construct ion [66]. For m > 2, the r-th order RM code 
RM r , m in |u|tx + t; (-construction is given below: 


RM r , m = {|ti|u + v| : u € RM r)m _! and v € RM r _ lim _i} (2.39) 


with generator matrix 




G rm (r,m - 1) G RM (r,m - 1) 

0 <2rm(** - l,m - 1) 


(2.40) 


where 0 is a zero matrix. Equation (2.40) shows that a RM code can be 
constructed from short RM codes by a sequence of |u|u + ^-constructions. 

Consider a boolean function f(x i,x 2 , . . . ,x m ) of m variables, Xi,x 2 , • • • 
which take values 0 or 1. For each combination of values of x u x 2t . . . , and x m , 
the function / takes a truth value either 0 or 1. For the 2 m combinations of 
values of xi,x 2 , . . . ,x m , the truth values of / form a 2 m -tuple over GF(2). 

For a nonnegative integer / less than 2 m , let (bn , &j 2 > * • • i fym) be the standard 
binary representation of /, such that l = bn H- 6 j 2 2 -h 6 ( 32 2 + • * * +b^ m 2 m 1 . For a 
given boolean function /(xi,x 2 ,. . . ,x m ), we form the following 2 m -tuple (t/uth 


vector): 

V - (vi,v 2 ,.. . ,Vl+ 1 , 

• • « , U 2 ”») 

(2.41) 

where 

uj+i — 

• i ^fm) 

(2.42) 


and (bn>bi 2 , . . . ,6j m ) is the standard binary representation of the index integer 
l. We say that the boolean function /(xi,x 2 ,... ,x m ) represents the vector v. 
We use the notation b(f) for the vector represented by /(xi,x 2 » • • • i Xjji ). 

For 1 < i < m, consider the boolean function 


f(x 1» x 2 , . . . , x m ) — Xj. 


(2.43) 


It is easy to see that this boolean function represent the vector Vi defined by 
(2.31). For 1 < M < m, the function 

/(xi,x 2 ,... ,x m ) = XiXj (2.44) 
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represents the logic product of and Vj y represented by g(x ux 2 , • ♦ ^m) = x» 

and h(x u x 2 ,x m ) = x ; , respectively. For 1 < t*i < i 2 < ■ ■ • < i r < the 
boolean function 

/(xi,x 2> . .. ,*rn) = x^Xit- '-Xi, (2.45) 

represents the logic product of . , , and v» r . Therefore, the generator 

vectors of the r-th order RM code of length N = 2 m are represented by the 
boolean functions in the following set 

i?(r, to) — X 2 , . . . t ®m» X t X 2 y X 1 X 3 , • • • » ®m— l^mj 

... up to all products of r variables}. (2.46) 

Let P(r,m) denote the set of all boolean functions (or polynomials) of degree 
r or less with m variables. Then 

RM r , m = {b(f) : / e P(r, to)}. (2.47) 


Finally, we want to point out that the dual code of the r-th order RM code, 
RM r m , is the (m — r — l)-th order RM code, RM m _ r _i, m . 
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3 TRELLIS REPRESENTATION OF 

LINEAR BLOCK CODES 


A code trellis is a graphical representation of a code, block or convolutional, in 
which every path represents a codeword (or code sequence). This representation 
makes it possible to implement maximum likelihood decoding (MLD) of a code 
with a significant reduction in decoding complexity. Chapter 3 presents the 
fundamental concepts and basic structural properties of trellises for linear block 
codes. An encoder with finite memory for a linear code is modeled as a finite- 
state machine. With this model, representation of the dynamic behavior of the 
encoder by a trellis diagram is easy to conceive. During an encoding interval, 
the state of the encoder at a specific time instant is simply defined by the 
information bits stored in the memory which affect both the past and future 
outputs of the encoder. To facilitate the construction of a code trellis, the 
generator matrix of a code is put in trellis oriented form. From this trellis 
oriented generator matrix, some basic structural properties can be derived. 
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3.1 TRELLIS REPRESENTATION OF CODES 

An encoder for a linear code C with a finite memory, for which the output 
code bits at any time instant during an encoding interval T = { 0 , 1 , 2 ,...} 
are uniquely determined by the current input information bits and the state 
of the encoder at the time can be modeled as a finite-state machine. The 
dynamic behavior of such an encoder can be graphically represented by a 
state diagram expanded in time, called a trellis diagram (or simply trellis) 
as shown in Figure 3.1. 

The encoder starts from some initial state, denoted <To. At any time instant i 
during its encoding interval T, the encoder resides in one and only one allowable 
state in a finite set. In the trellis diagram, the set of allowable states at time- 
i is represented by a set of vertices (or nodes) at the i-th level, one for each 
allowable state. The encoder moves from one allowable state at one time instant 
to another allowable state at the next time instant in one unit of time. This 
is called a state transition which, in the trellis diagram, is represented by 
a directed edge (or branch) connecting the starting state to the destination 
state. Each edge is labeled with the code bits that are generated during the 
state transition. The set of allowable states at a given time instant i is called 
the state space of the encoder at time-z, denoted Ei(C). A state < 7 * £ Ei(C) 
is said to be reachable if there exists an information sequence that takes the 
encoder from the initial state <7o to state <ii at time-t. Every state of the encoder 
is reachable from the initial state <tq. In the trellis, every vertex at level-z for 
i £ T is connected by a path from the initial state <r 0 . The label sequence of 
this path is a code sequence (or a prefix of a code sequence). Every vertex 
in the trellis has at least one incoming edge except for the initial state and at 
least one outgoing edge except for a state called the final state. Encoding of 
an information sequence is equivalent to tracing a path in the trellis starting 
from the initial vertex <t 0 . If the encoding interval T is semi infinite, the trellis 
continues indefinitely; otherwise it terminates at a final state, denoted 07 . 
Convolutional codes have semi infinite trellises, while the trellises for linear 
block codes terminate at the end of each encoding interval. 

For i £ T, let Ii and Oi denote the input information block and its corre- 
sponding output code block, respectively, during the interval from time-z to 
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i-th level 



transition 

Figure 3.1. Trellis representation of a finite state encoder. 


time-(i 4- 1). Then the dynamic behavior of the encoder for a linear code is 
governed by two functions: 

(i) Output function, 

Oi = 

where for /, / /'. 

(ii) State transition function, 

<7i+l = fi)i 

where a, € Ej(C) and cr, + 1 6 E i+1 (C) are called the current and next states, 
respectively. In the trellis diagram for C, the current and next states are 
connected by an edge (cr », 1 ) labeled with O 

A code trellis is said to be time-invariant if there exists a finite period 
{0,1 u}cT and a state space E(C) such that 
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(1) Ei(C) C E(C) for 0 < t < v and £i(C) = £(C) for i > v and 

(2) fi = f and g> = g for all : E T. 

A code trellis that is not time- invariant is said to be time- varying. A trellis 
diagram for a linear block code is, in general, time-varying. However, a trellis 
diagram for a convolutional code is usually time- invariant. Figure 3.2 and 3.3 
depict a time- varying trellis diagram for a block code and a time-invariant trellis 
diagram for a convolutional code, respectively. 

3.2 BIT-LEVEL TRELLISES FOR BINARY LINEAR BLOCK CODES 

Consider a binary (AT, K ) linear block code C with generator and parity-check 
matrices, G and H } respectively. During each encoding interval, a message 
of K information bits is shifted into the encoder memory and encoded into a 
codeword of N code bits. The N code bits are formed and shifted onto the 
channel in N bit times. Therefore, the encoding span T is finite and consists 
of N + 1 time instants, 

r = { 0 ,l ,2 

C can be represented by an AT-section trellis diagram over the time span T. Let 
£(C) denote the encoder for C. 

Definition 3.1 An AT-section trellis diagram for a binary linear block code C 
of length N y denoted T, is a directed graph consisting of iV+ 1 levels of vertices 
(called states) and edges (called branches) such that: 

(1) For 0 < i < Ny the vertices at the :-th level represent the states in the 
state space E*(C) of the encoder £(C) at time-:. At time-0 (or the 0-th 
level) there is only one vertex, denoted oo, called the initial vertex (or 
state). At time- N (or the AAth level), there is only one vertex, denoted 
<T/y called the final vertex (or state). 

(2) For 0 < : < AT, a branch in the :-th section of the trellis T connects 
a state <Ti-\ G Ei_i(C) to a state cr^ G £i(C) and is labeled with a 
code bit Ui that represents the encoder output in the bit interval from 
time-(: — 1) to time-:. A branch represents a state transition. 
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(3) Except for the initial state, every state has at least one, but no more 
than two, incoming branches. Except for the final state, every state has 
at least one, but no more than two, outgoing branches. The initial state 
has no incoming branches. The final state has no outgoing branches. 
Two branches diverging from the same state have different labels. 

(4) There is a directed path from the initial state <tq to the final state a/ 
with a label sequence (ui,u 2 , . . . if and only if (ui,U 2 , . . . is a 
codeword in C. 

AA 

Two states in the code trellis are said to be adjacent if they are connected 
by a branch. During one encoding interval T, the encoder starts from the initial 
state <To, transverses a sequence of states 

((To, » * • • , • • * ♦ <7/)» 

generates a code sequence 

and then reaches the final state <t/. The bit-level 8-section trellis diagram 
for the (8,4) linear block code given in Example 2.1 (Table 2.1) is shown in 
Figure 3.2. 

For 0 < i < N, let |E^(C)| denote the cardinality of the state space Ei(C). 
Then, the sequence, 

(|Eo(C)|, |Ei(C)|, . . . , |E„_i(C)|,|E w ((7)|), 

is called the state space complexity profile, which is a measure of the 
state complexity of the ^-section code trellis T. We will show later that for 
0 < i < N, |Ei(C)| is a power of 2. Define 

Pi(C) ± log 2 |Ej(C)|, 

which is called the state space dimension at time-t. When there is no 
confusion, we simply use pi for pi(C) for simplicity. The sequence, 

(p o,Pi>---.Pw), 


DRAFT 


January 6, 1998, 8:40pm 


DRAFT 



TRELLIS REPRESENTATION OF LINEAR BLOCK CODES 29 

is called state space dimension profile. From Figure 3.2, we see that the 
state space complexity and dimension profiles for the (8, 4) code given in Ex- 
ample 2.1 are (1, 2, 4, 8, 4, 8, 4, 2, 1) and (0, 1,2, 3, 2, 3, 2, 1,0), respectively. 

3.3 TRELLIS ORIENTED GENERATOR MATRIX 

To facilitate the code trellis construction, we put the generator matrix G in a 
special form. Let u = (ui , u 2 , . . . , u N ) be a nonzero binary N-tuple. The first 
nonzero component of 1 1 is called the leading ‘1’ of u and the last nonzero 
component of u is called the trailing *1* of t*. 

A generator matrix G for C is said to be in trellis oriented form (TOF) 
if the following two conditions hold: 

(1) The leading ‘1’ of each row appears in a column before the leading *1* 
of any row below it. 

(2) No two rows have their trailing “ones” in the same column. 

Any generator matrix for C can be put in TOF by two steps of Gaussian 
elimination. 

Example 3.1 Consider the (8, 4) RM code given in Example 2.1 with following 
generator matrix, 

' 11111111 ' 

0 0 0 0 1 1 1 1 

0 0 1 1 0 0 1 1 

0 10 10 101. 

It is not in TOF. By interchanging the second and the fourth rows, we have 

'11111111' 

0 10 10 10 1 
0 0 1 1 0 0 1 1 

0 0 0 0 1 1 1 1 . 
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Add the fourth row of the above matrix to the first, second and third rows. 
These additions result in the following matrix in TOF: 


’ 9i ' 


‘ 1 

1 

1 

1 

0 

0 

0 

0 “ 

9 2 


0 

1 

0 

1 

1 

0 

1 

0 

9j 


0 

0 

1 

1 

1 

1 

0 

0 

. 9 a . 


_ 0 

0 

0 

0 

1 

1 

1 

1 M 


AA 

The span of a row g = (< 7 i,p 2 > * • • » <7n) in a trellis oriented generator matrix 
(TOGM) G is defined as the smallest interval {»,i + 1, . . . which contains 
all the nonzero bits of g. This is denoted as span($) ^ [t, j]. For a row g in 
a TOGM G whose span is the active span of g y denoted aspan(<j), is 
defined as aspan(p) = [t, j - 1] for t < j and aspan(<j) = 0 (empty set) for i = ;. 
Let g x ,g 2 ,... >g K be the K rows of a TOGM G with 

9\ = CfiKii <7i2> * • . ,9in) 

for 1 < / < K. Then 



’ 9x 


" 9n 

9\1 

• 9\N 

G = 

9 2 

= 

921 

922 

• 92N 


. 9 k . 


_ 9k\ 

9K2 ■ 

■ • 9KN 


Let (ai,a 2 ,. . . ,a/f) be the block of K information bits (called a message) to 
be encoded. The corresponding codeword is given by 

u = (ui,u 2 ,...,ujv) 

= a \ * 9l + °2 • 92 * * * + a K * 9 K- 

We see that the J-th information bit a/ affects the output u of the encoder 
£ (C) over the span of the Lth row g t of the TOGM G. This span(< 7 j) may be 
regarded as the constraint length of the code associated with the l-th input 
information bit a*. 

At time-i with 1 < i < A r , the number of information bits that affect the next 
output code bit Ui+i is equal to the number of rows in G whose active spans 
contain i. These information bits define the state of the encoder at time-i. 
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3.4 STATE SPACE FORMULATION 

In the following, we give a mathematical formulation of the state space of the 
^/-section trellis for an (iV, K) linear block code C over GF(2) with a TOGM 
G. 

At time-i, 0 < i < N, the rows of G are divided into three disjoint subsets: 

(1) G£ consists of those rows of G whose spans are contained in the interval 

[Ml- 

(2) G{ consists of those rows of G whose spans are contained in the interval 

(3) G\ consists of those rows of G whose active spans contain t. 

Let A ?, A[ and A ■ denote the subsets of information bits that correspond to 
the rows of Gf,G{ and GJ, respectively. The information bits in A f do not 
affect the encoder outputs after time-i, and hence they become the past with 
respect to time-i. The information bits in A[ only affect the encoder outputs 
after time-i. Since the active spans of the rows in GJ contain the time instant 
t, the information bits in A* affect not only the past encoder outputs up to 
time-i but also the future encoder outputs beyond time-i. We say that the 
information bits in A* define a state of encoder £(C) for the code C at time-i. 
Let pi = |AJ| = |GJ|. Then there are 2 P * distinct states that the encoder £(C) 
can occupy at time-:; each state is defined by a specific combination of the pi 
information bits in A * . These states form the state space E;(C) of the encoder 
£ (C) (or simply of the code C). The parameter pi is the dimension of the 
state space E{(C). In the trellis representation of C, the states in Et(^) are 
represented by 2 P * vertices at the i-th level of the trellis. 

Example 3.2 Consider the TOGM G for the (8,4) RM code given in Exam- 
ple 3.1. The spans of the four rows are: span (g^ = [1,4], span(p 2 ) = [2,7], 
span(y 3 ) = [3,6], and span(^ 4 ) = [5,8]. Their active spans are therefore: 
aspan(<7 x ) = [1,3], aspan(</ 2 ) = [2,6], aspan(g 3 ) = [3,5] and aspan( 5 4 ) = [5,7], 
For each i with 0 < » < 8, counting the number of rows which are active at 
time-» yields the state space dimension profile (0, 1,2, 3, 2, 3, 2, 1,0). 

AA 
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The above formulation of a state space actually provides a sequential ma- 
chine model for the encoder £(C). 


3.5 STATE TRANSITION AND OUTPUT 


For 0 < i < N, suppose the encoder £(C) is in state € Et(C). From time-i 
to time-(i -hi), £(G) generates a code bit u*+i and moves from state <T{ to a 
state 6 £j+i(C). Let 


and 


(3.1) 

A ‘ = {o^.o^,. . . ,a£>} (3.2) 


where pi = |GJ|. The current state of the encoder is defined by a specific 
combination of the information bits in A*. 

Let g m be the row in G{ whose leading ‘T is at positional + 1). The unique- 
ness of this row g m (if it exists) is guaranteed by the first condition in the 
definition of a generator matrix in TOF given in section 3.3. Let denote 
the (i -h l)-th component of g m . Then <?* +1 = 1. Let a m denote the informa- 
tion bit that corresponds to row g m . It follows from (2.3) and the structure of 
the TOGM G that the output code bit Uj+i generated during the bit interval 
between time-i and tim e-(i + 1) is given by 


Ui+l 


+£ 

i - 1 




(3.3) 


where g\ l J+ x is the (i + l)-th component of in G*. Note that a* begins 
to affect the output of the encoder £(C) at time-(i + 1). For this reason, the 
bit a* is regarded as the current input information bit. The second term 
in (3.3) is the contribution from the state <t; defined by the information bits 
in i4* = which are stored in memory. From (3.3), we see 

that the current output tii+i is uniquely determined by the current state <j{ of 
the encoder £(C) and the current input a*. The output bit tij+i can have two 
possible values depending on the current input information bit a *; each value 
takes the encoder £(C) to a different state at time-(i + 1). That is, there are 
two possible transitions from the current state <Ji to two states in 
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time-(t +1). In the code trellis, there are two edges (or branches) diverging 
from the vertex <7* labeled with ‘O’ and T\ respectively. 

Suppose there is no such row g* in G [ . Then the output code bit is given 
by 

“i+1 = ( 3 - 4 ) 

i=l 

In this case, we may regard that the current input information bit a* is being 
set to u 0”, i.e. a* = 0 (this is called a dummy information bit). The output 
code bit u»+i can take only one value given by (3.4) and there is only one 
possible transition from the current state <Ti to a state in E*+i(C). In the 
trellis T, there is only one branch diverging from the vertex <t*. 

Example 3.3 Again we consider the (8,4) code with its TOGM G given in 
Example 3.1. Consider time-2. Then we find that G% = 0,(7 { = {^ 31 ^ 4 } an< ^ 
Gj = { 01 , 02 }- Therefore, the information bits ax and define the state of the 
encoder at time-2 and there are 4 distinct states defined by four combinations 
of values of ai and 02 , {00,01,10, 11}. We also see that g~ = <j 3 . Therefore, 
the current input information bit is a* = 03 . The current output code bit u 3 
is given by 


U3 sr <23 + <*1 * 013 + <*2 * 023 

= a 3 + ai. 


For every state defined by and 02 , u 3 has two possible values depending on 
03 . In the trellis, there are two branches diverging from each state at time- 2 , 
as shown in Figure 3.2. 

Now consider time-3. At t = 3, we find that C ? 3 = 0, G 3 = { 0 4 } and 
G 3 = {0i,02i 03 }- Therefore, the information bits ai,a 2 and 03 define 8 states 
at time-3, as shown in Figure 3.2. There is no row g m in G{ with leading *1’ 
at position (or time) i — 4. Hence we set the current input information bit 
a* = 0 . The output code bit u 4 is given by 

W 4 = fll * 014 + * 024 + <*3 * 034 

= Ol + 02 + a 3* 
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In the trellis, there is only one branch diverging from each of the 8 states, as 
shown in Figure 3.2. 


AA 

Let g° be the row in GJ whose trailing ‘1’ is at the position-^ + 1). (Note 
that this row g° may not exist.) The uniqueness of the row g° (if it exists) 
is guaranteed by the second condition of a generator matrix in TOF given in 

Section 3.3. Let a 0 be the information bit in A ? that corresponds to row g°. 
Then at time-(i -Hi), 


(3.5) 

and 

A i+i = (A*\{a°}) U {a*}. ( 3 .6) 

The information bits in Af +1 define the state space E i+: (G) at time-(t + 1). 
The change from A’ to A’ i+1 defines a state transition from the current state 
defined by A’ to the next state a i+l defined by AJ +1 . Therefore from A’, A* +1 , 
(3.3) and (3.4), we can construct the JV-section code trellis T for C. 

The construction of the ^-section trellis T is carried out serially, section by 
section. Suppose the trellis has been constructed up to section-t. Now we want 
to construct the (i + l)-th section from time-i to time-(» + 1). The state space 

£i(C) is known. The (t + l)-th section is constructed by taking the following 
steps: 

(1) Determine G’ +1 and A' l+1 from (3.5) and (3.6). Form the state space 
£;+i(C) at time-(i + 1). 

(2) For each state <r, 6 D,(C), determine its state transition(s) following the 
state transition rules given above. Connect a to its adjacent state(s) in 
£i+i(C) by edge(s). 

(3) For each state transition, determine the output code bit u i+1 from the 

output function of (3.3) or (3.4), and label the corresponding edge in 
the trellis with u,- + j. 


DRAFT 


January 6, 1998, 8:40pm 


DRAFT 



TRELLIS REPRESENTATION OF LINEAR BLOCK CODES 35 


3.6 TIME-VARYING STRUCTURE 


During the encoding interval T = {0, 1 , . . . , N} y the output function of the 
encoder £ (C) changes between (3.3) and (3.4). Also, the set {<7 i*!+i> 92 'i+n • • • > 
gM . +1 } in the summations of (3.3) and (3.4) may change from one time instant 
to another. This is because each column in the TOGM is, in general, not a 
downward shift of the column before it. Therefore, the output function of £(C) 
is time- varying. As the encoder £{C) moves from time-i to time-(t 4- 1), its 
state space may also change, i.e., E*+i(C) ^ Consequently, the trellis 

for £{C) is time- varying. 

To describe the time-varying state space of £ (C), there are four cases to 
consider. 

Case I: There is no such row g° in G\ y but there is a row g* in G[. As 
the encoder moves from time-i to time-(i + 1), the active span of g m 
contains the time instant i + 1. Therefore, g m is added to the set G* 
to form G t - +1 . The information bit a* that corresponds to g m is now 
in the encoder memory and is included in determining the next and 
future states of the encoder. The next state 0^+1 is determined by the 
information bits 


a l' ) > a 2^ > 




Since |GJ +1 | = |G?| + 1, p,+ i = Pi + 1. This results in state space 
expansion. 

Case II: There is a row g° G Gf and a row g m G G{. When the encoder 
moves from time-» to time-(i + 1), the span of g° moves into the interval 
[l,i + 1] and g° is replaced by g m in G- +1 . In this case, the information 
bit a 0 that corresponds to g° becomes part of the past with respect to 
time-(i + 1) and will not affect the encoder outputs further; however, the 
information bit a * is now in the memory and is included in determining 
the next and future states of the encoder. Assuming that a 0 = a [ l) , the 
next state cr.+i of the encoder is then determined by the information 
bits 

,ap-,a\ 

Therefore, from time-t to time-(t + 1), the state space of the encoder 
and its dimension remain the same, i.e., p»+i = Pi- 
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Case III: There is no such row <j° in G\ and no such row g * in G( . In this 
case, G * +1 = G * and there in no change in the state space dimension 
as the encoder moves from time-i to time-(i + 1). The next state is 
determined by the same set of information bits as at time-i, i.e., 




p* 


Case IV: There exists a row g° € G\ but there is no such row g m € G { . In 
this case, g° is excluded from G* to form GJ +1 and its corresponding 
information bit a 0 becomes part of the past as the encoder moves from 
time-i to time-(i +1). Assuming that a 0 = the state <Tj+ 1 of the 
encoder is determined by the information bits 






Consequently, |GJ +1 | = |G'| — 1 and pi+i = pi — 1. This results in state 
space reduction. 


Example 3.4 Consider the (8,4) code given in Example 3.1. Prom its TOGM 
G y we see that for i = 0, 1 and 2, there is no such row g° in G*, but there is 
a row g m in G { . Hence there is state space expansion from time-0 to time-3 
as shown Figure 3,2. We note that there is such a row g° in GJ and there is 
no such row g m in G3 . Therefore, there is state space reduction from time-3 to 
time-4, as shown in Figure 3.2. 

AA 


From the above analysis of the AT-section trellis for an (N y K) linear block 
code G, we have the following observations. At time-t, with 0 < t < AT, 

(1) The information bits in A? become the past and do not affect the future 
outputs of the encoder beyond time-i. 

(2) The information bits in A{ affect the encoder outputs only beyond time- 
i, i.e., they are the future input information bits. 

(3) The information bits in A J are the bits stored in the encoder memory 
that define the encoder state at time-i. 

The above observations make the formulation of a trellis diagram for a block 
code the same as for a convolutional code [62]. 
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3.7 STRUCTURAL PROPERTIES 

For 0 < i < j < N y let Cij denote the subcode of C consisting of those 
codewords in C whose nonzero components are confined to the span of j — i 
consecutive positions in the set {» + 1, * + 2, « . . , j}. Clearly, every codeword in 
Cij is of the form, 

( 0, 0, ■ . ■ , 0 , 1 , it i-f-2 » • • • i Vj i o» 0, • » » t o ) • 

t N-i 

It follows from the definition of Cij and the structure of the TOGM G for 
C that Ci j is spanned by those rows in G whose spans are contained in the 
interval [i + 1, j]. The two subcodes, Co y i and C^x* are spanned by the rows 
in G? and G{, respectively, and they are called the past and future subcodes 
of C with respect to time-i. 

For a linear code Z), let k(D) denote its dimension. Then, fc(Co,i) = \G? \ 
and fc(C*,jv) = |G{|. Recall that the dimension of the state space E»(C) at 
time-* is 

Pi(C) = \G’\ = K-\G p i \-\G{\ 

= K - k{C a<i ) - k{Ci, at). (3.7) 

This gives a relationship between the state space dimension Pi(C) at time-* 
and the dimensions of the past and future subcodes, Co t i and of C with 

respect to time-i. 

Note that C 0f i and C^x have only the all-zero codeword 0 in common. 
The direct-sum of C7 0 ,i and C^Xy denoted C 0 ,i 0 C^x, is a subcode of C with 
dimension 

k{C 0 J + k{C itN ). 

Let C/(C 0 ,i 0 Ci,x) denote the partition of C with respect to C 0 ,i 0 C^x- 
Then this partition consists of 

|C/(Co,i®C ilW )| = 2 jr - k(c ’*- j, - fc(C - Jv) 

= 2 P ‘ (3.8) 

cosets of C 0 ,i © Ci n- Eq.(3.8) says that the number of states in the state space 
E,(C) at time-i is equal to the number of cosets in the partition C j (Cq^SC,^ ). 
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Time 



A coset in 

poAcycfo 


A coset in 
Pi,N(C)/C? <N 


Figure 3.4. The paths in the code trellis that represent the 2 K Pi codewords in v 0 
(C 0 ,i 0 Ci,n). 


Let Si denote the subspace of C that is spanned by the rows in G*. Then 
each codeword in Si is given by 


= K 
_ 


(o „<*> 


2 


G; 




+ a 


(<) 


• g W 

&Pi 


(3.9) 


where a[ l) € A ■ for 1 < / < pi . The 2 pi codewords in Si can be used as 
the representatives for the cosets in the partition C/(C 0 ,i 0 C^n)- Therefore, 
Si is the coset representative space for the partition C/(Co,» 0 From 

(3.9), we see that there is one-to-one correspondence between v and the 
state <Ti € Ei(C') defined by (<4‘ ) ,<4 ,) ,...,<4* ) ). Since there is a one-to-one 
correspondence between v and a coset in G/(Co,t 0 therefore, there is 

a one-to-one correspondence between a state in the state space £t(G) and 
a coset in the partition C/(Cqj 0 Ci y #). 

With the above one-to-one correspondence in the trellis T, the codeword v 
given by (3.9) is represented by a path that passes through the state <Ji defined 
by the information bits, <4° , <4°, Ap) (i.e., a path that connects the initial 
state (T 0 to the final state a f through the state <r;). If we fix the information 
bits, a['\ <4°, ..., ap,\ and allow the other K- pi information bits to vary, we 
obtain 2 K ~ pi codewords of C in the coset 


v © (C 0 ,« © C i>N ) = {v + u : tt € C 0 ,i © C^.w} (3.10) 

with t; as the coset representative. In the trellis, these 2 K ~ pi codewords are 
represented by paths that connect the initial state Tq to the final state o j 
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through the state at time-i defined by the information bits, a^, . . 
a"p } , as shown in Figure 3.4. Note that 

K - pi — fc(Co,») + k{Cij sr) (3.11) 

which is simply the dimension of 0 

For 0 < i < j < N } let Pij{C) denote the linear code of length j - * obtained 
from C by removing the first : and last N — j components of each codeword 
in C . This code is called a punctured (or truncated) code of C. Let 
denote the punctured code of the subcode Cjj, i.e M 


C*i - P*A C i,j)- 

(3.12) 

It follows from the structure of the TOGM G that 


k( PlJ {C)) = K- k(C 0 ,i) - k(C j<N ) 

(3.13) 

and 


*(c&) = HCij). 

(3.14) 


Consider the punctured code Po y i(C). Partition po,i(C) based on Cq\. It 
follows from (3.13) and (3.14) that the partition Po^CJ/Cq^ consists of 

2K-k(Co.i)-kiC itN ) _ 2 p. 

cosets of Cq\. We can readily see that there is a one-to-one correspondence 
between the cosets in po^ty/C^ and the cosets in C/(C 0 ,i®C^tf )» anc * hence a 
one-to-one correspondence between the cosets inpo,»(C)/f?o r i anc * the states 
in the state space E^C). The codewords in a coset in Pq^(C)/Cq T x are simply 
the prefixes of the codewords in its corresponding coset in C/(Co y i ® C^). 
Hence the codewords in a coset of Po,i(C')/Co^ will take the encoder £{C) to 
a unique state <J X € Ei(C). In the trellis T, they are the paths connecting the 
initial state cr 0 to the state <J{ as shown in Figure 3.4. Let L(<7o,<Ti) denote the 
paths in the trellis T that connect the initial state <Tq to the state in E^(C). 
Then L(<7 0> cr,) is a coset in the partition Po,t(^)/CoV 

Now we consider the punctured code Pi,jv(C'). Partition pi y s{C) based on 
C\* N — p»,Ar(C , i t /v). Then it follow from (3.13) and (3.14) that the partition 
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Time 0 

a 

C To 


0 



0 




N 

-o 


Figure 3.5. Paths in the code trellis that represent the codewords in <V 
PiAc)/c\: N consists of 

2 K-k{C 0 t i)-k(C iiN ) 2 P. 

cosets of C\* N . Again we see that there is a one-to-one correspondence 
between the cosets in Pi t N{C)/C±* N and the cosets in C/(C 0 ,i © and 

hence a one-to-one correspondence between the cosets in Pi y N{C)/Cl T N and 
the states in the state space £i(C). In the trellis, the codewords in a coset 
PiAC)lc\' N form the paths that connect a state G £i(C) to the final state 
<jf as shown in Figure 3.4. Let L{(Xi,<jj) denote the paths in the trellis T that 
connect the state <7* € £,(<?) to the final state Gf. Then L{<Ti><Tf) is a coset in 
the partition pi y N(C)/C\*#. 

For 0 < i < j < N y let and crj°* denote two states on the all-zero path 0 
in the trellis T at time-; and time-j, respectively. Let denote the 

paths of length j — t in T that connect to crj°*. Consider the paths in T 
that start from the initial state cr 0 , follow the all-zero path 0 to the state crj°\ 
transverse through the paths in L((7j°\crj 0 *) to the state then follow the 
all-zero path 0 until they reach the final state <7/ as shown in Figure 3.5. These 
paths represent the codewords in the subcode Cij of C. This implies that 

L(a|°V' 0) ) = <?£, (3.15) 

Let v = (t>i, t >2 > ■ . . , vn) be a path in the code trellis T. For 0 < ; < j < N, 
let <r\ v ^ and be two states on the path v at time-i and time-j, respectively. 
Let denote the paths of length j — i that connect to <rj v \ 

Consider the paths in T that start form the initial state 0 * 0 , follow the path v 
to the state cr\ v \ transverse through the paths in L(a\ v \ <r ^ ), then follow the 
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Figure 3.6. Paths in the code trellis that represent the codewords in the coset u 0 Cij. 

path v until they reach the final state 07 as shown in Figure 3.6. These paths 
represent the codewords in the following coset of Ci^: 

v 0 Cij = + (3.16) 

This is a coset in the partition C/Ci^. This implies that L(a\ v \a^) is a coset 
of C\*j in p itj (C ), i.e., 

L{a ( ” ] ,o\ v] ) = pi j{y) + Cjr € Pij(C)/C%, (3.17) 

where pi >; (u) denotes the vector of length j — i obtained from t; by removing the 
first i and last N — j components of v. For any two connected states er* G £*(<7) 
and <j j G Ej(C) with 0 < i < j < N, they must be on a path in the trellis T. 
It follows from (3.17) that 


Ite,*i)€w J -(C)/C&, (3.18) 


Therefore, the number of paths that connect a state a x G £i(C) to a state 
Oj G £j(C) is given by 


... .. J 2« c -», if., 


and <7j are connected, 
and <jj are not connected. 


(3.19) 


For 0 < i < j < k < N, let £j(<7i,<7jfc) denote the set of states in £ ; (C) 
through which the path in L(^,<jjt) connect the state cr* to the state o* as 
shown in Figure 3.7. Let 

o L(<7j,<j k ) = {u o V : u e L(<Ti,<Tj),v € L(ffj,a k )} (3.20) 
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Time 


j 


k 



Figure 3.7. The state set Ey(<7t,<rjk). 

where u o v denotes the concatenation of two sequences u and v. In the trellis, 
L(<Ti,<jj) o L(<jj,<Xk) consists of those paths in L(a{,ak) that connect the state 
Gi to the state <Tk through the state <7j. Then, 

L(ai,<rk)= (J L(<Ti,erj) o L(crj,ffk)- ( 3 - 21 ) 


The above developments give some fundamental structural properties of an 
7V-section trellis T for a linear block code C. 
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4 STATE LABELING, TRELLIS 
CONSTRUCTION PROCEDURES AND 

TRELLIS SYMMETRY 


The construction of a code trellis can be facilitated by labeling the states at 
each level of the trellis. State labeling is also necessary in the implementation 
of a trellis-based decoding algorithm. This chapter presents three methods 
for labeling the states of the TV-section trellis for an (N, K) linear block code. 
The first two methods are based on the information set that defines the state 
space at a particular encoding time instant and the third method is based on 
the parity-check matrix of the code. The first two methods are more efficient 
than the third one for codes with K < N - K; however, the third method is 
more efficient for codes with N - K < K. Based on these labeling methods, 
construction procedures for the N - section trellis for an (N, K) linear block code 
are presented. Also presented in this chapter is the mirror symmetry structure 
of a code trellis. This symmetry structure is useful in decoding. 
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4.1 STATE LABELING BY THE STATE DEFINING INFORMATION 
SET 

In a code trellis, each state is labeled by a fixed sequence (or given a name). 
This can be accomplished by using a if-tuple A with components correspond- 
ing to the K information bits, ax,a 2 ,... ,a/c, in a message. At time-i, all the 
components of A are set to zero except for the components at the positions 
corresponding to the information bits in A\ = {a^a^, . . . ,a^}. Every com- 
bination of the pi bits at the positions corresponding to the information bits 
in A 1 gives the label l(<Ti) for the state defined by the information bits, 
a {i) a {i) a {i) 

Example 4.1 Consider the (8,4) code given in Example 3.1. At time-4, we 
find that A\ = {02,03}. There are 4 states corresponding to 4 combinations of 
a 2 and a 3 . Therefore, the label for each of these 4 states is given by (0, a 2 , a 3 , 0). 

AA 

The construction of the ^-section trellis for an {N,K) linear block code C 
can be carried out as follows. Suppose the trellis T has been constructed up to 
section -z. At this point, G?, AJ and E*(C) are known. Each state a* £ E»(C) 
is labeled by a ff-tuple. The (t + l)-th section is constructed by taking the 
following steps: 

(1) Determine G* +1 and A* +1 from (3.5) and (3.6). 

(2) Form the state space E* + i(C) at time-(* -f 1) and label each state in 
£m(C) based on A* +1 . The state in £»+i(C) form the vertices of the 
code trellis T at the (i + l)-th level. 

(3) For each state <j{ £ £i(C) at time-i, determine its transition(s) to the 
state(s) in £i+i(C) based on the information bits of a m and a 0 . For each 
transition from a state <T{ € Ei(C) to a state <r; +1 £ E;+x(C), connect 
the state <7, to the state 0^+1 by an edge (<7i,<7i+ 1 ). 

(4) For each state transition (cr^er^+x), determine the output code bit u i+ x 
and label the edge (ir^ai+x) with Ui+j. 

Recall that at time-i, there are two branches diverging from a state in E;(C) 
if there exists a current information bit a*. One branch corresponds to a* = 
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Table 4.1. State defining sets and state labels for the 8-section trellis for the (8, 4) linear 
block code. 


i | 

G’ 

a* 

a° 

At 

State Label 

0 

0 

fli 

- 

0 

( 0000 ) 

1 

i9i) 

02 


w 

(01OOO) 

2 


a 3 

- 

{01,02} 

(oia 2 00 ) 

3 

{ 01 , 02 , 03 } 

- 


{01,02,03} 

(oia 2 O 3 0 ) 

4 

{ 02 , 03 } 

o 4 

- 

{02,03} 

(O02O3O) 

5 

{ 02 > 03 ' 04 } 

- 

03 

{02,03,04} 

(0020304) 

6 

{fl2.S<} 

- 

02 

{02,04} 

(O02OCI4 ) 

7 

{9 4} 

- 

O4 

{04} 

(000a 4 ) 

8 

0 

- 

- 

0 

(0000) 


0 and the other corresponds to a* = 1. For the convenience of graphical 
representation, in the code trellis T, we use the upper branch to represent 
a* = 0 and the lower branch to represent a" = 1. If a* is a dummy information 
bit, then there is only one branch diverging from each state in £ t *(C). This single 
branch represents a dummy information bit. Using the above representation, 
we can easily extract the information bits from each path in the trellis (the 
dummy information bits are deleted). 

Example 4.2 Consider the state labeling and trellis construction for the (8, 4) 
RM code given in Example 3.1 whose TOGM G is repeated below, 


" 9i ' 


‘ 1 

1 

1 

1 

0 

0 

0 

0 ’ 

92 


0 

1 

0 

1 

1 

0 

1 

0 

9z 


0 

0 

1 

1 

1 

1 

0 

0 

. 9a . 


_ 0 

0 

0 

0 

1 

1 

1 

1 _ 


For 0 < i < 8, we determine the submatrix G* and the state defining in- 
formation set A\ as listed in Table 4.1. From Af, we form the label for each 
state in Ei(C) as shown in Table 4.1. The state transitions from time-i to time- 
(i + 1) are determined by the change from A* to Following the trellis 
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Figure 4.1. The 8-section trellis diagram for the (8,4) RM code with state labeling by 
the state defining information set. 

construction procedure given above, we obtain the 8-section trellis diagram for 
the (8,4) RM code as shown in Figure 4.1. Each state in the trellis is labeled 
by a 4-tuple. 


In many cases, we do not need K bits for labeling the states of the iV-section 
trellis for a binary (N,K) linear block code C. Let (p 0 ,pi,. > . ,pjv) be the state 
space dimension profile of the trellis. Define 

Pmax(C) = Q m^Pi (4.1) 

which is simply the maximum state space dimension of the trellis. From (3.7), 
we find that p m a.x{C) < K. In general, p max is smaller than K. Since the 
number of states at any level of the trellis is less than or at most equal to 
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Table 4.2. State labeling for the (8,4) RM code using Pmax(C) = 3 bits. 


i | 


State Label 

0 

0 

(000) 

1 


(o lOO) 

2 

{01,02} 

(oia 2 0) 

3 

{ 01 , 02 , 03 } 

( 01 O 2 O 3 ) 

4 

{02,03} 

(02O3O) 

5 

{02,03,0,} 

(020304) 

6 

{02,04} 

(0,040) 

7 

{ 04 } 

(04OO) 

8 

0 

(000) 


p max (C) bits are sufficient for labeling the states in the trellis. Con- 
sider the state space £i(C) at time-t with 0 < i < N which is defined by the set 
{a^a^,. . . of Pi information bits. For each state £ £i(C), we form 

a pmAx{C)’tuple, denoted in which the first pi components are simply 

a [ %) , , . . . , and the remaining p m&x (C) — pi components are set to 0 , i.e., 


l( a <) — ( a i' ) i°2* ) > • • • i a p^»0.0, . . . ,0). 


(4.2) 


Then l(cri) is the label for the state 

Example 4.3 Again we consider the (8,4) RM code given in Example 4.2. 
From the TOGM G of the code, we find the state space dimension profile of 
the 8 -section trellis for the code to be (0, 1 , 2, 3, 2, 3, 2 , 1 , 0). Hence p m * x {C) = 3. 
Using 3 bits for labeling the states as described above, the state labels are given 
in Table 4.2. Compared to the state labeling given in Example 4.2, one bit is 
saved. 

AA 
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4.2 STATE LABELING BY PARITY-CHECK MATRIX 


Consider a binary (N>K) linear block code C with a parity-check matrix 


H = [hi, h 2 ,...,hj,... , 


(4.3) 


where, for 1 < j < N, hj denotes the j-th column of H and is a binary 
(N — iif)-tuple. A binary //-tuple c is a codeword in C if and only if 


cH t = 



N-K 


(4.4) 


where H T denotes the transpose of H. C is called the null space of H. 

Let 0 n-k denote the all-zero ( N — J£)-tuple (0,0, ... ,0). For 1 < i < JV, 
let Hi denote the submatrix that consists of the first i columns of H, i.e., 


= (4.5) 

It is clear that the rank of H, is at most N — K, i.e., 

Rank(Zfj) < N - K. (4.6) 

Then for each codeword c £ CJ^, 

c * Hj = 0 n-k- (4.7) 


Cq { is the null space of H 
Now we consider the partition 

PoAG)/C&. 

Let D be a coset in Pof/Cfc and D ^ Cq { . For every vector a £ D, 

a -Hj = (si,s 2t . ■ • ,sn-k) ®n-k (4.8) 


and is the same for all vectors in D , i.e., for ai,a 2 6 D and a\ ^ a 2 , 

ai * H? = a 2 • Hj = (si,s 2 ,*.*,sn-jc). (4.9) 

The ( N — Zf)-tuple (si, S 2 , . . . , ss-k) is called the label for the coset D. Let 
D\ and D 2 be two different cosets in po^ty/C^. Let a\ £ D\ and a 2 E Di- 
li follows from the theory of linear block codes that ai ^ a 2 and 

a.HT jia 2 .HT. 


DRAFT 


January 6, 1998, 8:40pm 


DRAFT 



STATE LABELING, TRELLIS CONSTRUCTION PROCEDURES 49 

This says that different cosets in Pa,i(C) / have different labels. 

Recall the mathematical formulation of the state spaces of a code trellis. 
There is a one-to-one correspondence between a state a in the state space 
E t (C) at time-t and a coset D € Po,.(C)/ and the codewords of p 0il (C) in 
D form the paths that connect the initial state a 0 to state a. This one-to-one 
correspondence leads to the definition of a state label. 

Let L(a 0 ,a) denote the set of paths in the code trellis for C that connect 
the initial state cr 0 to a state u in the state space E,(C) at time-t. 

Definition 4.1 For 0 < i < N, the label of a state a € E^C) based on a 
parity-check matrix H of C, denoted 1(a), is defined as the binary (N - K)- 
tuple 

l(&) — a ■ Hf = (si,s 2} . . . ,s N _ K ), ( 4 - 10 ) 

for any o 6 L(a 0 ,a). For t = 0, Hi = 0 and the initial state a 0 is labeled with 
the all-zero (AT-AQ-tuple, 0*..*-. Fort = N,L(a 0 ,a f ) = C and the final state 
<jf is also labeled with 

AA 

It follows from the above definition of a state label, the one-to-one correspon- 
dence between the states in E;(C) and the cosets in Po,»(C')/C5 r i for 0 < i < N, 
and (4.10) that every state a € E*(C) has a unique label and different states 
have different labels. 

For Q < t < N, \et cri and <Ti+i be two adjacent states with < 7 { 6 £i(C) 
and <Ji+\ € £i+i(C). Let Uj+i be the label of the branch in the code trellis 
that connects state a i to state ffi+i. The label u,+i is simply the encoder 
output bit in the interval from time-i to time-(t + 1) and is given by (3.3) or 
(3.4). For every path (u u u 2) . . . , u { ) 6 L(<r 0 ,^), the path (ui,ii 2 , . . . , u t+1 ) 

obtained by concatenating (ui,u 2 ,... ,Ui) with the branch u i+1 is a path that 
connects the initial state cr 0 to the state <r <+ 1 through the state Hence, 
(ui,u 2 ,... ,Uj,Ui + i) £ L(c7o,(Tt +1 ). Then it follows from the definition of a 
state label that 


0 — ( U 1 1 u 2 > * • • > 1 
— (^1 , ti 2 , . . . , Uj) * Hi “h 

= l(<7i) + u i+ i-hj +1 . (4.11) 
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Eq.(4.11) simply says that given the starting state labeled /(o^) at time-z and 
the output code bit u,+i during the interval between time-z and time-(z + 1), 
the destination state labeled l(a x+ 1 ) at time-(z + 1) is uniquely determined. 

Now we present a procedure for constructing the N - section trellis diagram 
for a binary (N, K) linear block code C by state labeling using the parity- 
check matrix of the code. Let u = (t«i,U 2 ,.. . ,ujyr) be a binary iV-tuple. For 
0 < z < N, let po,*(tx) denote the prefix of u that consists of the first i 
components, i.e., 

Po,«( u ) = (ui,u 2 ,...,Ui). (4.12) 

Suppose that trellis has been completed up to the z'-th section (or time-z). At 
this point, the rows of the TOGM G in the set G * = {g g %\ . . . x g$} and 
their corresponding information bits . . . , uniquely define a state 

<x t G Let 


- JO . 


tz — a 


(0 Ji) 


•pr + ®2 -g* + 


~ **pi yp i 


Then po,. (u) is a path connecting the initial state <7 0 to the state a x defined by 
a 2 *\ . . . , o^p] • The label of state er * is given by 


K<Ti) =Po,i(u)Hf. 


The construction of the (t + l)-section of the code trellis is accomplished by 
taking the following four steps: 

(1) Identify the special row g m (if any) in the submatrix G{ and its cor- 
responding information bit a*. Identify the special row g° (if any) in 
the submatrix G*. Form the submatrix Gf +1 by including g m in G \ and 
excluding g° from G*. 

(2) Determine the set of information bits, A* +1 = {a[ l+1 \ aj l+1 ^ , . . . , }, 

that correspond to the rows in G- +1 . Define and label the states in 

Ei+i(C). 

(3) For each state <j x € £,(C), form the next output code bit u »+ x from 
either (3.3) (if there is such a row g * in G{ at time-z) or (3.4) (if there 
is no such row g m in G{ at time-z). 
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Figure 4.2. 8-section trellis for (8, 4) RM code with state labeling by parity-check matrix. 

(4) For each possible value of u i+ i (two if computed from (3.3) and one if 
computed from (3.4)), connect the state Oi to the state o ,+ 1 € Ei+i(C) 
with label 

J^i+i) = f( £r «) + u «+i ‘ 1- 

The connecting branch, denoted L(rj,,cr t ±i) , is labeled with Uj+i . This 
completes the construction of the (» + l)-th section of the trellis. 

Repeat the above steps until the entire code trellis is constructed. 

Example 4.4 Consider the (8, 4) RM code given in Example 3.1. This code is 
self dual. Therefore, a generator matrix is also a parity-check matrix. Suppose 
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we choose the parity-check matrix as follows: 

" 11111111 " 
00001111 
0 0 1 1 0 0 1 1 
. 01010101 . 

Using this parity-check matrix for labeling and following the above trellis con- 
struction steps, we obtain the 8 -section trellis with state labels shown in Fig- 
ure 4 . 2 . To illustrate the construction process, we assume that the trellis has 
been completed up to time-3. At this time instant, GJ = {<Ji»<72»03} 

A\ = {ai, 02 , 03 } are known. The eight states in E 3 (C) are defined by the 
eight combinations of oi,a 2 and a 3 . These 8 states and their labels are given 
below: 



states defined 
by ( 01 , 02 , 03 ) 

state labels 

— (°) 
a 3 

(000) 

( 0000 ) 


(001) 

(1010) 


(010) 

(1001) 

J3) 

(011) 

(0011) 

„(«) 

a 3 

(100) 

(1011) 

„<*) 

(101) 

(0001) 

„(6) 

(110) 

(0010) 

JV 

(111) 

(1000) 


Now we want to construct the 4-th section of the trellis up to time-4. At time-3, 
from the TOGM G, we find that g° = g x and there is no such row g* with 
leading T’ at time-4, Therefore, G\ = {s 2 ,0 3 } A \ = {° 2 ,o 3 }. The four 
states in E 4 (C) at time-4 are defined by the four combinations of a 2 and a 3 . 
The four codewords generated by the rows in G* are: 


( 02 , 03 ) 

paths 

(0,0) 

u 0 = (00000000) 

(0,1) 

Ui = (00111100) 

(1,0) 

u 2 = (01011010) 

(1.1) 

u 3 = (01100110) 
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The four paths that connect the initial state ffo to the four states, denoted <7 4°*, 
and in £«(C) are: 

Po,4(“o) = (0000), 

Pm(ui) = (0011), 

P0,4(ti 2 ) = (0101), 

P0,4(«3) = (0110). 

The submatrix H\ is 

' 1 1 1 1 ‘ 

0 0 0 0 
0 0 11' 

.0101. 

Frompo^{uj), with 0 < j < 3 and H 4 , we can determine the labels for the four 
states, ^ 0) ,<T4 1 ) ,o' 4 2) and <r[ Z \ in E 4 (C) which are given below: 



states defined 
by (02,03) 

state labels 

J°) 

^4 

(00) 

(0000) 


(01) 

(0001) 

_( 2 ) 

(10) 

(0010) 

J3) 

(11) 

(0011) 


The four states and their labels are shown in Figure 4.3 at time-4. Now suppose 
the encoder is in the state cr* 5) with label f^ 5 *) = (0001) at time-3. Since no 
such row g m exists at i = 3, the output code bit u 4 is computed from (3.4) as 
follows: 


u 4 = 1 • g\ 4 + 0 • <724 + 1 • <734 

= 1 * 1 + 0 - 1 + 11 
= 0. 


Then the state is connected to the state in E 4 (C) with label 

l(a^) + u t hj = ( 0001 ) + 0 - ( 1011 ) 

= ( 0001 ), 
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Time-3 Time-4 

Figure 4.3. State labels at the two ends of the 4-th section of the trellis for (8, 4) RM 
code. 

which is state a[ l \ The connecting branch is labeled with u 4 = 0. The connec- 
tions from the other states in E 3 (C) to the states in E 4 (C) are accomplished 
in the same manner. 

AA 

State labeling based on the state defining information sets requires K (or 
p m „(C)) bits to label each state of the trellis; however, state labeling based on 
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the parity-check matrix requires N — K bits to label each state of the trellis. 
Therefore, labeling method- 1 is more efficient for codes with K < N — K while 
labeling method-2 is more efficient for codes with K > N — K. 

4.3 STRUCTURAL SYMMETRY 

Consider a binary (N,K) linear block code C with even length N and TOGM 



Si 


9n 

012 

01 AT 

G — 

9 2 


921 

022 

02N 


. 9k . 


_ 9KI 

9K2 * 

9kn _ 


Let T denote the AT-section trellis diagram for C. Suppose the TOGM G has 
the following symmetry property: For each row g in G with span(p) = [a, 6], 
there exists a row g f in G with span (g f ) = [N +1— 6, AT + 1 — a]. With this 
symmetry property in G, we can readily see that for 0 < i < N/2, the number 
of rows in G that are active at time-(N — i) is equal to the number of rows in 
G that are active at time-i. This implies that 

|E*_i(C)| = |Ei(C)| 

for 0 < i < N/2. We can permute the rows of G such that the resultant matrix, 
denoted G\ is in a reverse trellis oriented form: 

(1) The trailing ‘T of each row appears in a column before the trailing ‘1’ 
of any row below it. 

(2) No two rows have their leading “ones” in the same column. 

If we rotate the matrix G r by 180° counter clockwisely, we obtain a matrix G” 
in which the *-th row g” is simply the (iisT -hi — i)-th row g r K+ \^i of G f in 
reverse order (the trailing ‘1 T of becomes the leading ‘l 1 of g" and the 

leading ‘1* of becomes the trailing T of g"). FYom the above, we see 

that G" and G are structurally identical in the sense that 

span(s") = span(gi) 
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for 1 < i < K. Consequently, the AT-section trellis T for C has the following 
mirror symmetry [101]: The last N/2 sections of T form the mirror image 
of the first N/2 sections of T (not including the path labels). 

Example 4.5 Consider the (8, 4) RM code given in Example 4.2 with TOGM 

1 1 1 1 0 0 0 0 ' 

0 10 110 10 
0 0 11110 0 
0 0 0 0 1 1 1 1 . 

We find that span(fl 1 ) = [1,4], span(p 4 ) = [5,8], and^ and g A are symmetrical 
with each other. Row g 2 has span [2, 7] and is symmetrical with itself. Row 
g 2 has span [3,6] and is also symmetrical with itself. Suppose we permute the 
second and third rows of G . We obtain the following matrix in reverse trellis 
oriented form: 

1 1 1 1 0 0 0 0 " 

0 0 11110 0 
0 10 110 10 
0 0 0 0 1 1 1 1 



9 1 


9 2 



9 3 


. 9* . 



Rotating G ' 180° counter clockwisely, we obtain the following matrix: 


' Si' ' 


“ 1 

1 

1 

1 

0 

0 

0 

0 ' 

92 


0 

1 

0 

1 

1 

0 

1 

0 

9z 


0 

0 

1 

1 

1 

1 

0 

0 

Lsi' J 


_ 0 

0 

0 

0 

1 

1 

1 

1 . 


We find that G" and G are in fact identical, not just structurally identical. 
Therefore, the 8-section trellis T for the (8, 4) RM code has mirror symmetry 
with respect to the boundary location 4, the last four sections form the mirror 
image of the first four sections as shown in Figures 3.2 and 4.1. 

AA 


For the case that N is odd, if the TOGM G of a binary (AT, K ) code C has 
the mirror symmetry property, then the last (N — 1)/2 sections of the iV-section 
trellis T for C form the mirror image of the first ( N — l)/2 sections of T. 
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For the case that G" = G, the JV-section trellis T of C has full mirror 
symmetry structure [101]. For N even, the last N/2 sections of T in reverse 
direction (the final state < 7 / is being regarded as the initial state) is completely 
identical to the first N/2 sections of T (including the path labels). The 8-section 
trellis of the (8,4) RM code has full mirror symmetry as shown in Figure 4.1. 
For N odd, the last ( N - l)/2 sections of T in reverse direction are completely 
identical to the first (N - l)/2 sections of T (including the path labels). 


DRAFT 


January 6, 1998, 8:40pm 


DRAFT 





TRELLIS COMPLEXITY 


This chapter is devoted to analyzing the complexity of an AT-section trellis 
diagram for an (N, K) linear block code. Trellis complexity is, in general, mea- 
sured in terms of the state and branch complexities. These two complexities 
determine the storage and computation requirements of a trellis-based decod- 
ing algorithm, such as the Viterbi decoding algorithm. The state complexity of 
a trellis is measured by its state space dimension profile and the branch com- 
plexity is measured by the total number of branches (or edges) in the trellis. 
In Section 5.1, a simple upper bound on the maximum state space dimension 
is derived. It is proved that the state complexity of a linear block code is the 
same as that of its dual code. In Section 5.2, the concepts of a minimal trellis 
diagram and optimum bit permutation in terms of state complexity are intro- 
duced. It is proved that the trellis construction based on a TOGM results in 
a minimal trellis. In Section 5.3, the branch complexity of an iV-section trellis 
diagram is analyzed. Finally, in Section 5.4, the general structure of N - -sect ion 


DRAFT 


January 6, 1998, 8:40pm 


DRAFT 



60 TRELLISES AND TRELLIS-BASED DECODING ALGORITHMS 

trellis diagrams for cyclic codes is given. It it shown that the maximum state 
space dimension meets the upper bound derived in Section 5.1. 


5.1 STATE COMPLEXITY 

For a binary ( N , K) linear block code C, the state complexity of an //-section 
bit-level code trellis is measured by its state space dimension profile 

(Po,Pi,P 2 , • • ■ ,Pn), 


where for 0 < i < N, 


Pi = log 2 |S»(C)|. 


Let p max (C) denote the maximum among the state space dimensions, i.e., 


Pmax(C) = max pi. 

V 7 0<t<iV 

Using the construction method described in Chapter 3, the state space dimen- 
sion at time-* is given by (3.7), 


Pi — K — k(C Qyi ) — k(Ci^), 

for 0 < i < N. Since &(Co,i) and k(C{^) are nonnegative, we have 

Pmaut(C) < K . (5.1) 

However, it follows from (4.6) and the definition and uniqueness of a state label 
at any tim e-i (see (4.10)) that 

|Ei(C)| < 2 n ~ k 


and 

Pi<N-K (5.2) 

for 0 < i < N. Eq.(5.2) implies that 

Pm«(C) <N -K. (5.3) 

Combining (5.1) and (5.3), we have the following upper bound on the maximum 
state complexity: 

Pm a x(C) < min{A r , N - K}. (5.4) 
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This bound was first proved by Wolf [109]. In general, this bound is quite loose. 
However, for cyclic (or shortened cyclic) codes, this bound gives the exact state 
complexity. For noncyclic codes, tighter upper bounds on pmax(C^) have been 
obtained. 

If the Viterbi algorithm is applied to the AT-section trellis of a code, then the 
maximum numbers of survivors and path metrics needed to be stored are both 
2 Pm**(C) Therefore, the parameter p m * x {C) is a key measure of the decoding 
complexity (or trellis complexity). 

For 0 < t < min {AT, AT — AT}, it follows from the structure of a TOGM G 
that the number of rows in G whose active spans contain the time index t is no 
greater than i. For * > max{AT, N — AT}, since there is one-to-one correspon- 
dence between the states in S»(C) and cosets in the partition Pi,N(C)/C\* N , 

Pi = k(p^ N {C))-k{C\\ N ) 

< HPi,N{C)) 

< N-i 

< min{K,N- AT}. (5.5) 

Therefore, for 0 < i < N, we have the following upper bound on pi\ 

pi < min{t, AT, N — K } N — t}. (5-6) 

Let C x denote the dual code of C. Then C x is an (N> N — K) linear block 
code. Consider the N-section trellis diagram for C x . For 0 < i < N> let 
£i(C x ) denote the state space of C x at time-i. Then there is a one-to-one 
correspondence between the states in E i(C x ) and the cosets in the partition 
Po,*(C rX )/C r ^ tr where C^ tr denotes the truncation of in the interval [l,i]. 
Therefore, the dimension of Ei(C x ) is given by 

Pi {C L ) = k(p 0ii (C L )) - k(c^ lr ). (5.7) 

Note that po t j(C' x ) is the dual code of and Cq- 1t is the dual code of 
Po,i(C)- Therefore, 

fc(po,i(C x )) = i-k(C l 0 r) 

= i - k{C 0li ) (5.8) 
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and 


k(C 0 ± f) = i-k(p 0 AC)). (5.9) 

It follows from (5.7), (5.8) and (5.9) that 

PiiC^) = K- k(Co,i) - k(Ci, N ). (5.10) 

From (3.7) and (5.10), we find that for 0 < i < N> 

Pi(C x ) = Pi (C). (5.11) 

This says that C and its dual code C have the same state complexity. 

5.2 MINIMAL TRELLISES 

An Af-section trellis is said to be minimal if the total number of states in the 
trellis is minimum. A minimal trellis is unique within isomorphism [77], i.e., 
two minimal trellises for the same code are isomorphic (structurally identi- 
cal). The above definition of minimality is commonly used in the literature. 
However, a more meaningful and useful definition of minimality of a trellis is 
in terms of its state space dimension profile. An N- section trellis is said to 
be a minimum state space dimension trellis if the state space dimension 
at each time of the trellis is minimum. A more precise definition is given as 
follows. Let T be an N-section trellis for an (N> K) code C with state space 
dimension profile (po,pi,. • « ,p/y). T is said to be minimal if, for any other 
AAsection trellis T f for C with state space dimension profile (p f Q yp[ i . . . , p f N ) y 
the following inequality holds: 

Px < p'i, 

for 0 < i < N. 

Suppose a minimum state space dimension trellis T exists. Then, it is clear 
that T is a minimal trellis in total number of states. The formulation of state 
spaces given in Section 3.4 results in a minimum state space dimension trel- 
lis (or minimal trellis) for an (AT, K) linear block code. This will be proved 
in Theorem 5.1. This says that a minimum state space dimension trellis T 
exists for any linear block code C. From the uniqueness of a minimal trellis 
in total number of states within graph isomorphism, the minimal trellis is a 
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minimum state space dimension trellis. This gives the equivalence between the 
two definitions of minimality of a trellis for a linear block code. 

Theorem 5.1 Let C be a binary (N, K) linear block code with trellis oriented 
generator matrix G. The N-section trellis T for C constructed based on G is a 
minimum state space dimension trellis. 

Proof: We only need to prove that for 1 < i < N> the number of states, 2 pi , 
at time-i in the trellis T is minimum over all the trellises for C, where 


Pi = k - k(c 0 ,i) - k(Ci, N y 

Let C* denote the linear subcode of C that is spanned by the rows in the 
submatrix G* of G. Then \C*\ = 2 pi . For two different codewords ti and v in 
C- y it follows from condition (1) of a TOGM that 

Po,i(u) #Po,i(w). 

This implies that 

|{po.i(tt) : u G Cl) I = 2 pi . 

Suppose there is a trellis T* for C whose number of states at time-i is less 
than 2 pl . Then, there must be two different codewords u and v in C* such 
that: (1) there are two paths connecting the initial state to a state a at time-i 
in T ' whose label sequences are po,i(ti) and po,t(t;), respectively; and (2) there 
is a path connecting the state a to the final state in T f whose label sequence is 
Pi y w (u). Without loss of generality, we assume u ^ 0. 

Let ti' denote the binary W-tuple such that po 1 «(u f ) = Po,i(v) and pj ( yv(u') = 
p ii jv(u). Since u' is a path in T r connecting the initial state to the final state, 
it follows from condition (4) of Definition 3.1 of an 7V-section trellis for a linear 
block code that u* is a codeword in C. Therefore, tt 4- u' 6 C. Note that 
Po,i(u + u / ) = po,«(tt) +Po,i(v) # 0 andp i ,jv(u + u / ) = p i>w (u) +p i , JV (tx') = 0. 
This implies that ti + u 9 € Co,*. There are three cases to be considered: 

(1) Suppose u f G Co,». This implies that ti € which is a contradiction 
to the hypothesis that u 6 C* and u^O. 

(2) Suppose v! G C*. This implies that u + u # G Since ti + u* ^ 0, 

ti + ti' can not be in both C* and Cqj. This results in a contradiction. 
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(3) Suppose u 9 € This implies that u € Co,* ® which is not 

possible. 

Therefore, v + u 9 can not be a codeword in C. This results in a contradiction 
to our earlier hypothesis that there exists an IV-section trellis T' for C whose 
number of states at time-* is less than 2*. Therefore, the hypothesis is invalid 
and 2 Pi gives the minimum number of states at time-* for 1 < t < N. 

AA 


It follows from Theorem 5.1 that Eq.(3.7) gives the minimum state space 
dimension p\ with 0 ^ i ^ ^ fur an iV-section trellis for an (IV, K ) linear block 
code. From (3 7), we see that the state space dimension pi at time-* depends 
on the dimensions of the past and future codes, Co,, and . For a given 
code C, k{C 0 ,i) and k(C i}N ) are fixed. 

Given an (N,K) linear block code C, a permutation of the orders of the bit 
(or symbol) positions results in an equivalent code C' with the same weight 
distribution. Different permutations of the bit positions may result in different 
dimensions, k{C 0 ,i) and k{C it N ), of the past and future subcodes, C 0 ,i and 
Ci,jv, and hence different state space dimensions at time-*. A permutation 
that yields the smallest state space dimension at every time of the code trellis is 
called an optimum permutation (or bit ordering). It is clear that an optimum 
permutation reduces the state complexity and is often desirable. Optimum 
permutation is hard to find, however optimum permutations for RM codes 
are known [45] but they are unknown for other classes of codes. 

5.3 BRANCH COMPLEXITY 

The branch complexity of an N-section trellis diagram for an ( N,K ) linear 
block code C is defined as the total number of branches in the trellis. This 
complexity determines the number of additions required in a trellis-based de- 
coding algorithm to decode a received sequence. 

Consider the N-section trellis diagram T for C which is constructed based 
on the rules and procedures described in Chapters 3 and 4. Recall that at 
time-* with 0 < t < N, there are two branches diverging from a state in Ei(C) 
if there exists a row g‘ in G { ; and there is only one branch diverging from a 
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state in £i(C) if there exists no such row g m in G[. Define 

1. if 3* i G{, 

’ \ 2, if g" G g{. 

Let E denote the total number of branches in the N - section trellis T. Then 


(5.12) 


i=0 

= £ 2 Pi ■ Ii(g-). (5.13) 

«=o 

Example 5.1 Again we consider the (8,4) linear block code given in Exam- 
ple 3.1. From Table 4.1, we find that 


lo(g-) = h(9 m ) = W) = Mfl*) = 2 


and 

W) = h{9l = W) = W) = i- 

The state space dimension profile of the 8- section trellis for the code is (0, 1, 
2, 3, 2, 3, 2, 1, 0). From (5.13), we have 

E = 2° • 2 + 2 1 • 2 + 2 2 • 2 + 2 3 • 1 + 2 2 • 2 + 2 3 • 1 + 2 2 - 1 + 2 1 • 1 

= 2 + 4 + 8-b8 + 8 + 8 + 4 + 2 

= 44. 


AA 

An TV-section trellis diagram for an (N,K) linear block code is said to be 
a minimal branch (or edge) trellis diagram if it has the smallest branch 
complexity. A minimal trellis diagram has the smallest branch complexity [69]. 
Branch complexity also depends on the bit ordering of a code. Proper permuta- 
tion of the bit positions of a code may result in a significant reduction in branch 
complexity. A permutation that results in minimal branch complexity is called 
an optimum permutation. From (5.13), we can readily see that a permutation 
which minimizes each product in the summation of (5.13) is a minimal edge 
trellis diagram. A good permutation in terms of branch complexity should have 
the following property: for 0 < i < N y when pi is large, /<(</*) should be equal 
to 1. 
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5.4 TRELLIS STRUCTURE OF CYCLIC CODES 

Consider an {N y K) cyclic code C over GF(2) generated by the following poly- 
nomial [62], 

g(X) = 1 + g\X + g^X 2 + • • • + gs-K^X**-*-' + X N ~ K t 

where for 1 < i < N - AT, $ € GF(2). A generator matrix for this cyclic code 
is given by 

1 0i 02 0 jv-jc-i 1 0 0 0 

0 1 0i 02 9N-K - 1 10 ••• 0 

(5.14) 

00—0 1 g x 02 • ••• 0N-tf-i 1_ 

The A" rows of G are simply the K cyclic shifts of the first row. This generator 
matrix has the following properties: 

(1) It is in trellis oriented form. 

(2) For 1 < z < K , the span of the i-th row g i is 

span(^) = [t,N- K + i]. 

(3) The active spans of all the rows have the same length, N - K. 

Now we consider the bit-level trellis structure for this ( N , K) cyclic code. 
There are two cases to be considered: K > N - K and K < N - K. Consider 
the case for which K > N -K. For 1 < i < N - K , the number of rows whose 
active spans contain the time index i is z. These rows are simply the first z 
rows. For N — K < i < K y the number of rows whose active spans contain the 
time index t is N — K. For K < i < N y the number of rows whose active spans 
contain the time index-z is N — i. Since z > K , 

N - i < N - K. 

From the above analysis, we see that the maximum state space dimension is 
Pmax(C) = N — K and the state space profile is 

(0,1,...,^-^— 1,JV — AT,..., Af-tf, N- K- 0). 
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Now consider the second case for which K < N — K . For 1 < * < Ky the 
number of rows whose active spans contain the time index t is i (the first * 
rows). For K < i < N — K y the number of rows whose active spans contain the 
time index i is K. For N - K < i < JV, the number of rows whose active spans 
contain i is N - *. Since i> N - K, N -i <K. FVom the above analysis, we 
find that the maximum state space dimension is 

Pmax(^) ~ Ky 

and the state space dimension profile is 

(0,1 iif — 

Putting the results of the above two cases together, we conclude that for an 
(N, K) cyclic code, the maximum state space dimension is 

Pm»x(C) = min {K,N - K}. 

This is to say that a code in cyclic form has the worst state complexity (i.e., it 
meets the upper bound on the state complexity). 

The generator polynomial g(X) of an {N t K) binary cyclic code C divides 
X N + 1 [62]. Let 

X" + 1 = g(X)h(X). 

Then h(X) is a polynomial of degree K of the following form: 

h{X) = 1 + hxX + h 2 X 2 + • • • + h K - 1 X K ~ 1 + X K 

with hi E GF(2) for 1 < i < K. This polynomial is called the parity-check 
polynomial. The dual code C 1 of C is an (AT, N-K) cyclic code with generator 
polynomial 

X K h{X - 1 ) = l + h Ar -iX + -**+/i 1 X K “ 1 

The generator matrix for the dual code is 

1 hfc-i hx-2 * 0 0 

0 1 hx-ihx-2 1 0 •••0 

H= 0 0 1 hK-xhK-i 1 * * * 0 (5.15) 


[o 0 0 1 hx-ihx-2 lj 
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which is in trellis oriented form. The trellis structure for C 1 - can be analyzed 
in the same manner as for C. 

The trellis of a cyclic code has mirror symmetry, i.e., the right-half and 
left-half of the trellis with respect to the center are structurally identical. 

To reduce the state complexity of a cyclic code, a permutation of the bit 
position is needed [45, 46]. 

The branch complexity of the N-section trellis diagram T constructed based 
on the TOGM G given by (5.14) can be evaluated easily. First we note that 


I ( *) = { 2 ’ for 0 ^ * < K > 
| 1, otherwise. 


Suppose K < N — K. Then branch complexity of the trellis T is 
K - 1 K - 1 

E = 2* • 2 + {N — 2K) ■ 2 k + 2 K ~' ■ 1 


t=0 


i=0 


= 2 • (2* - 1) + {N - 2K) ■ 2 k + 2 - (2* - 1) 

= (N - 2K + 4) • 2 k - 4. 


(5.16) 


For K > N — K, we have 

N-K-l 


E = JZ 2‘ • 2 + 2 • (2K — N) ■ 2 N ~ K + £ 2 N ~ K ~ i 


*=0 


N-K-l 

£ 

i=0 


= (AK — 27V + 4) • 2 


N-K 


-4. 


(5.17) 


5.5 TRELLISES FOR NONBINARY LINEAR BLOCK CODES 

The methods for constructing trellises for binary linear block codes can be gen- 
eralized for constructing trellises for nonbinary linear block codes with symbols 
from GF(<j) in a straightforward manner. The symbol-level iV-section trellis 
diagram for an (N y K) linear block code C over GF(q) has the following basic 
properties: (1) every branch is labeled with a code symbol from GF(<y); (2) 
except for the initial state, every state has at least one, but no more than q y 
incoming branches; (3) except for the final state, every state has at least one, 
but no more than q , outgoing branches; and (4) the initial state has no in- 
coming branch and the final state has no outgoing branch. In the definition of 
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a trellis oriented generator matrix, the leading u l" and trailing “1* of a row 
are replaced by leading and trailing w nonzero components’*, respectively. The 
maximum state space dimension p max (C) of the minimal Af-section trellis for 
C is upper bounded by 

Pm**(C) < min{K,N - K}, 

and the maximum number of states, |E(C)| roM . is upper bounded by 

|E(C)U» < q roi “ {,C ’*- ,r ‘- 

For Reed-Solomon (RS) codes over GF((j), the above equalities hold, i.e., 

pm*x(C) = min{R' 1 N - K}, 

and 

|E(C)| ro „ = 
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6 


TRELLIS SECTION ALIZATION 


So far, we have only considered bit-level AT-section trellis diagrams for linear 
block codes of length N. In a bit-level trellis diagram, every time instant in 
the encoding interval F = {0, 1, 2, . . . , N} is a section boundary location and 
every branch represents a code bit. It is possible to sectionalize a bit-level 
trellis with section boundary locations at selected instants in the encoding 
interval T. This sectionalization results in a trellis in which a branch may rep- 
resent multiple code bits and two adjacent states may be connected by multiple 
branches. Proper sectionalization may result in useful trellis structural prop- 
erties and allow us to devise efficient trellis-based decoding algorithms. This 
chapter is devoted in analyzing sectionalized trellis diagrams for linear block 
codes. Section 6.1 presents the concepts and rules for trellis sectionalization. 
In Section 6.2, the branch complexity and state connectivity are analyzed and 
expressed in terms of the dimensions of codes related to the code being con- 
sidered. In Section 6.3, construction of a sectionalized trellis diagram for a 
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linear block code based on the trellis oriented generator matrix is presented. 
Section 6.4 studies the parallel structure of a sectionalized trellis diagram. 

6.1 SECTIONALIZATION OF A CODE TRELLIS 

For a positive integer L < N y let 

U±{h Qj h u h 2y ... y h L } (6.1) 

be a subset of L + 1 time instants in the encoding interval T = {0, 1, 2, . . . , N} 
for an (N y K) linear block code C with 0 = ho < hi < h 2 < • • • < h L = N. 
An L-section trellis diagram for C with section boundaries at the locations 
(time instants) in t/, denoted T(C/), can be obtained from the N-section trellis 
T by: (1) deleting every state in E/j(C) for h € {0, 1 , . . . , N} \ U and every 
branch to or from a deleted state, and (2) for 1 < j < L, connecting a state 
a € Ehj-t to a state a 1 G E^ by a branch with label a if and only if there is a 
path with label a from state a to state a f in the N - section trellis T. In an L- 
section trellis with boundary locations in U = {h 0 ,hi,... ,/*£,}, a branch from 
a state in E^_ X (C) to a state in E hj(C) represents ( hj - h>-i) code symbols. 

A subgraph of a trellis diagram is called a subtrellis. The subtrellis of 
T(U) which consists of the state space E/* i _ l (C r ) at time-hj_i, state space 
Eh^C) at time-/ij, and all the branches between the states in Eh J _ 1 (C) and 
E^C), is called the j-th section of T(U). The length of the j-th section is 
hj — hj- 1 . If the lengths of all the sections of an L-section code trellis T(U) 
are the same, T(U) is said to be uniformly sectionalized. In an L-section 
trellis diagram with L < JV, two adjacent states may be connected by multiple 
branches (called parallel branches) with different labels. 

Let phj — log 2 |E h> (C)| be the dimension of the state space E^(C) at time- 
hj. Then 

{POiPh^Phty * • • ,Ph L - X ,PN) 

is the state space dimension profile of the L-section code trellis T(U) with 
section boundary set U = {0, hi, /i 2> . . . , hi-\ , N}. FVom (3.7), we have 

PHj = K - k(C 0ihj ) - k(C hjiN ) . (6.2) 

If we choose the section boundaries, U = {ho, hi, . . . , hj,}, at the places where 
• • • yPh L - 1 are smcill, then the resultant L-section code trellis T(U) has 
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Figure 6.1. A 4-section trellis for the (8,4) RM code. 

a small state space dimension profile. The maximum state space dimension is 

pL,max(C) — max ph.' (6.3) 

0<j<L 

In implementing a trellis-based decoder, such as a Viterbi decoder, a proper 
choice of the section boundary locations results in a significant reduction in 
decoding complexity. 

Example 6.1 Again, we consider the (8,4) RM code given in Example 3.1 
whose 8-section trellis diagram is shown in Figure 3.2 (or Figure 4.1). Suppose 
we choose L = 4 and the section boundary set U = {0,2, 4,6, 8}. Follow- 
ing the above rules of sectionalization of a code trellis, we obtain a uniform 
4-section trellis diagram as shown in Figure 6.1, in which every branch repre- 
sents 2 code bits. The state space dimension profile for this 4-section trellis 
is (0,2,2, 2,0) and the maximum state space dimension is p 4 , ma x(C) = 2. It 
is a 4-section, 4-state code trellis. From Figure 6.1, we notice that the right- 
half of the trellis (the third and fourth sections) is the mirror image of the 
left-half of the trellis (the first find second sections). This mirror symmetry 
allows bidirectional decoding. Furthermore, the code trellis consists of two 
parallel and structurally identical (isomorphic) subtrellises without cross 
connections between them. This parallel structure allows us to devise two 
identical 2-state (Viterbi) decoders to process the trellis in parallel. The mirror 


DRAFT 


January 6, 1998, 8:40pm 


DRAFT 


74 TRELLISES AND TRELLIS-BASED DECODING ALGORITHMS 


symmetry and parallel structure not only simplify the decoding complexity but 
also speed up the decoding process. For large code trellises, these structural 
properties are very important in IC (integrated circuit) implementations. 

AA 

An L-section trellis diagram obtained from a minimal iV-section trellis dia- 
gram by deleting states and branches at places other than the section boundary 
locations is minimal, i.e., it is a minimal L-section trellis diagram for a given 
section boundary set U. 

6.2 BRANCH COMPLEXITY AND STATE CONNECTIVITY 

Consider the j-th section of a minimal L-section trellis diagram T(U) with 
section boundary set U = {ho, hi,... ,h^} for an (N> K) linear code C. The 
boundaries of this section are hj_i and hj. Each branch in this section is 
labeled with hj - h ; _ x bits. Let a and be two adjacent states in the state 
spaces ( C ) and E^ {C ) , respectively. Let L(<r, <r') denote the set of parallel 

branches connecting a and a'. Let L(<Jq,<7) denote the set of paths connecting 
the initial state a Q to the state a. Sometimes, it is convenient to regard the 
parallel branches, L^o-'), between two states as a single branch. This single 
branch is called a composite branch, and L(cr y c r') is called a composite 
branch label. 

The branch complexity of a trellis section is measured by: (1) the size of a 
composite branch; (2) the number of distinct composite branches in the trellis 
section; and (3) the total number of composite branches in the trellis section. 
The overall branch complexity of the trellis is then the sum of the trellis section 
branch complexities. These three branch complexity parameters can be ex- 
pressed in terms of the dimensions of Ch } ,N, and Phj^^hjiC) 

which can be obtained from the TOGM G of C. 

Let (Th^i and a ^ be two adjacent states with and 

£ It has been shown in Section 3.7 that the parallel branches in 

L(<7h j _ 1 ,(7/ >> ) form a coset in the partition 


Phj-uhjity/Chj^hj- 
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Therefore, the number of parallel branches between two adjacent states 
and an, in the j-th section of T(U) is 

|^ hi _ l ,a h> )| = 2 t(Cfc >-‘, ) . (6.4) 


In Section 3.7, we have also shown that 

L{a 0> <r hj _ x ) ep 0 ,h J . l (C)/C l 0 \ hj _ i (6.5) 

and 

&hj ) ^ P0yhj{C)/CQ h .. ( 6 . 6 ) 

If and <7^^ are adjacent, then 

L{a^(T hj _ x ) o L{a h ._ iy a h .) = 

{u 0 V : u € L((7Q y Chj-t) and v € (6.7) 


is the set of paths in T(U) that diverge from the initial state <To, converge 
at the state and then transverse the parallel branches in L(a/i j ._ 1 ,cr^ j ) 

to the state as shown in Figure 6.2. It has also been shown in Sec- 
tion 3.7 that £(<70,0^^) o L(0 , / l> _ 1 , <Jh } ) is a subcode of a coset in the par- 
tition Po,hj (C)/C,5 r v Let denote the set of states in the state space 

Ehj.^C) that are adjacent to state 07^ as shown in Figure 6.2. Then 

[J L[<To,<Th,_ x ) 0%,.,,^) =L{<r 0 ,^hj) (6.8) 


is a coset in the partition po.hj (C)/Cq^. 

Note that the dimensions of L(0* o , cr^_J, L{(Th 3 ^ l ^hj) and L(a 0,0^) are 
k(C' 0 \. J, k(CfJ. , h> ) and ^(^07^), respectively. It follows from (6.8) that 
the number of states in x (07^) ls given by 


l*V.K)| 


2 fc(C" h .)-t(C" hi _ l )-t(Ci'_ l . Jk .) 

2 HCo, h} )-k(C 0 , hj _ l )-k(C kj _ l . hj ) 


(6.9) 


This implies that the number of composite branches converging into a state 
a hj € %hj(C)> called the incoming degree of <7^ , is given by 

degto-hjin = 2 fc(Co,, > ) ' fc(Co ^-> )_fc(C ' , 7- 1 *> ) . (6.10) 
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Time-/ij_i 


Time-hj 


Figure 6.2. State connectivity. 


This number is a measure of the state connectivity of the sectionalized code 
trellis T(U). In an IC implementation of a Viterbi decoder, this number is 
known as the radix number, a key design parameter. 

Since each composite branch L(<Tf l ._ l ,crt l .) in the j - th section of T(U) is a 
coset in the partition pn } _ lf h i {C)/C^ h y the number of distinct composite 
branches in the j - th section of T(U) is 

( 6 . 11 ) 

It follows from (6.2) and (6.10) that the total number of composite branches in 
the j-th section of T(U) is given by 

2 K-k(C 0 ' hj _ l )-k(C hjtN )-k(C ttj _ l 'i tJ ) /g 


FVom (6.11) and (6.12), we find that each distinct composite branch appears in 
the j-th section of T(U) 

2K-*{c 0 , hj _ l )-k{c hitN )-k{ Pkj _ lthj (c)) ( 6 13 \ 
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times. 

From (6.2) (with hj replaced by hj-i) and (6.12), we can compute the 
number of composite branches diverging from a state € E/ l ^_ l (C) at 

time-hj-x as 

(6.14) 

which is called the outgoing degree of <7^^, denoted deg(<7/, i _ 1 )out- Equa- 
tions (6.4), (6.10)-(6.12) and (6.14) give the branch complexity and state con- 
nectivity of the j-th section of a minimal L-section trellis T{U) with section 
boundary locations in U = {/io, hi, * • • , hi}. 

Define 

Sj = log 2 deg(cr h) )i a 

with <Th 1 € E^(C). The ordered sequence (^ 1 ,^ 2 , • • • ,6l) is called the con- 
verging branch dimension profile (CBDP). Define 

A ; = log 2 deg^hjout. 

The ordered sequence (A 0 , Ai, . . . , A l _i) is called the diverging branch di- 
mension profile (DBDP). 

Let Mj denote the total number of composite branches in the j-th trellis 
section (given by (6.12)) and define 

Pj - l°g2 

The ordered sequence 

is called the branch complexity profile (BCP). The branch complexity of 
the minimal L-section trellis T(U) in terms of the total number of branches in 
the trellis is given by 

L 

B = ]T 2^ • 2 k[Ch >-' V. (6.15) 

;=i 

Since each branch in the j-th section of T(U) represents hj - hj^i code bits, it 
is equivalent to hj - hj_i branches in the bit-level AT- section trellis T for the 
code. Therefore, the branch complexity in terms of bit branches is given by 

L 

E = ~ h i- 1 ) ' ' 2 k(C ' k ->- , fc > ) . ( 6 . 16 ) 

j= i 
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If the section boundary is U = {0, 1, 2, , N} y then (6.16) gives the branch (or 
edge) complexity of the bit-level ^-section minimal trellis of the code. 

6.3 A PROCEDURE FOR CONSTRUCTING A MINIMAL L-SECTION 
TRELLIS 

A minimal L-section trellis diagram for an (iV, K) linear block code C can be 
constructed directly from the TOGM G. Let U = {ho, hi, ... , hi} be the set of 
section boundary locations with h 0 = 0 < hi < • • • < hi- \ < hi = N. Again 
the construction of the minimal L-section trellis diagram T(U) with section 
boundary set U is carried out serially, section by section. Suppose T(U ) has 
been constructed up to the j-th section (i.e., up to time-fy) with 1 < j < L. 
Now we begin to construct the (j + l)-th section from time-hj to time-hj+i. 
Partition the rows of the TOGM G into three disjoint subsets, G]J , G£ , and 
G£ as follows (also shown in Figure 6.3): 

(1) G£ consists of those rows in G whose spans are contained in the interval 

[1, hj\. 

(2) G^ consists of those rows in G whose spans are contained in the interval 

[h/+l,N]. 

(3) G 9 h . consists of those rows in G whose active spans contain the time 
index hj. 

It is clear that G^ and G generate the past and future codes, Go,^ and 
Ch }J Ny respectively. Let A B h be the set of information bits that correspond 
to the rows of G 9 h . Then the bits in A 9 h define the state space E/ lj (G) at 
time-/ij. That is, for any binary phj = |A£j-tuple, which represents values of 
information bits in A a h , there is a corresponding state in E^C). 

To determine the composite branches between states in E^(G) and states 
in Eh J+l (C) and the parallel branches between two adjacent states, we further 
partition the rows of G^. into three subsets, Gj[^ +i , and ^ 

follows (see Figure 6.3): 

(I) consists of those rows of G £ whose spans are contained in the 

interval [A, -|- l,hj+i]. 
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Time 


0 hj hj+i N 



(2) G^ s h +i consists of those rows of G whose active spans contain the 
time index hj+\. 

(3) The remaining rows in G £ form 

Let Al' P u and A( ,s k denote the subsets of information bits that corre- 

n ji n j+i 

spond to the rows in G f h ,p hj+i and G{£ hj+1 , respectively. Then the information 
bits in these two sets may be regarded as the current input information bits. 
These input bits together with the state of the encoder at time-hj uniquely 
determine the output code bits between time-fy and time-/»j + i. Note that 
the information bits in A(' p . only affect the output during the interval be- 
tween time-/» ; and time-fy+i. Therefore, they determine the parallel branches 
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between two adjacent states. The information bits in determine the 

diverging composite branches from a state in 

Let p h>> fc i+l (Gg h . +l ), Pfc Jt fc J+1 (Oj[;* hi+l ) and Ph ., hi+1 (GJ.) denote the trun- 
cations of G{’ p k » G{’ J h and Gi from time-fr, to time-/i,+i. The rows in 

SP™ the code the rows in PS.S + .( G fc% +l )* 

p hj ,h j+l ( G £fc i4 , ) “ d ( G U span the truncated codep hi ,fc J+l (C). Then 
every composite branch between a state <7^ € (G) and a state € 

E hj+1 (C) is a coset in the partition Ph i ,h i ^ l {C)l C h i ,h i+l - The number of par- 
allel branches between two adjacent states is therefore |Cj£ jhi+1 |. 

Let <Jh, be the state at time-/i ; defined by the binary p/,, -tuple formed by 
the binary information bits in the set A' h , , 


(a[ j) A i] «£>.), (6.17) 

where p hj = log 2 |E h> (C)| = | G%.\. Let ,5^} denote the rows 

in GJ,. Then 


u 




(;) Jj) 


sr +«2 -*2 + 


M) 

Ph, 


1 U) 

’Phi 


(6.18) 


is a codeword (or path) passing through the state <r h } at time-fy. Let phj,h j+1 (ti) 
denote the branch on u from time-/i ; * to time-Ziy+i. Let B} lj ,h 3 + l denote the 
code of length h j+i -hj generated by Ph i ,S+.( G h’% >+1 )- Then for ever y vector 
b € there is a composite branch diverging from the state cr^ j which 

consists of the following parallel branches, 

{ Phj,hj+i (u) + 6 + x : x £ C%. h . +1 }. (6.19) 


Therefore, the number of composite branches diverging from the state cr^ } is 

|£h i ,h i + 1 |. 

Next, we analyze the state transitions. Partition the matrix G 9 h into two 
submatrices, Gj£ hj+1 and Gj£ hj+l , where 

(1) G a ^ p h t consists of those rows in G 9 h . whose active spans do not contain 
the time index hj+ 1 , and 

(2) Gi' s u consists of those rows in Gi whose active spans contain the 

V t " + 1 3 

time index hj+i- 
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Let A*^ p h +i and denote the sets of information bits corresponding 

to ^A* f h i+1 » respectively. Then the set of information bits that 

defines the state space E/» i+1 (C) at time-hj+x is given by 




- {Ah, \ A‘^ hj+l ) U A f h ’* h . +i 

= A%' hj+i U A f h '* hi+i . 


( 6 . 20 ) 


Therefore, the state transitions from t\me-hj to time-hj+i are completely spec- 
ified by the change from A 9 h > to A 9 h . 

Define 


PhiMi+i ~ “ \ G h‘,hj 


( 6 . 21 ) 


Then it follows from (6.20) (also Figure 6.3) that 


\ A hj,h )+i 


i = Ph>- phj,h j+l > 


( 6 . 22 ) 


and 


\A f h %J =P'> J+ . -Ph„h 1+l - 


(6.23) 


Let a° h be the binary (ph } — P^,h J+1 )-tuple formed by the binary information 
bits in the set A^ p h , be the binary Ph^+j-tuple formed by the infor- 
mation bits in the set , and a m h . be the binary (ph >+1 — 

formed by the information bits in the set Af^ a h Then (a® , aj^) defines a 
state, denoted cr(a° . , ), in the state space £ h 3 (C) at time-hj, and (a^ , a m h . ) 

defines a state, denoted a£ ), in the state space Eh i+1 (C) at time-h ;+1 . 

State tr(a£ ,a^) is adjacent to state ff(fl^,aj.). The composite branch that 
connects these two states in the trellis is given by (6.19) with 


b = c- hj -p hj ,h J+l (G f hX +l )- (6.24) 

Note that these two states share a common a ^ . For ^ , the state cr(a^ , 

) at time-hj is not adjacent to the state <r( a' h , aj^ ) at time-hy+i . Therefore, 
from each state a(a° hjy a^ i ) in E^C), there are 2 Ph ;+i possible tran- 

sitions to the states < 7 ( 0 ^ ,aj.) in 2/» i+1 (C) with € {0, l} Ph i+ l “ Ph i**i+» . 
This completely specifies the state transitions from time-fy to time-h ;+1 . 

State labeling based on the state defining information set A 9 h with 0 < j < L 
is exactly the same as described in Section 4.1. We may use either a K - tuple 
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or a PL.maxC^J-tuple to label a state. In general, PL,m*x(^) is much smaller 
than K y and hence using a Pt.max^-bit label for a state is more efficient. 

Suppose an Irsection trellis diagram T(U) with boundary location set U = 
{ho, hi, ... , h^} has been constructed up to time-hy. The trellis section from 
time-hj to time-hj+i can be constructed by the following procedure: 

(1) Form and label the states in E^. +l (C) based on 

(2) For each state in Ejij(C'), determine its transitions to the states in 
E/i i+l (C) based on the state transition rules described above. 

(3) For two adjacent states, ^(flyfl^) and ^(a/^ , o^), at time-hj and 
time-hy+x, connect them by parallel branches given by (6.19). 

Repeat the above procedure until the L-section trellis T(U) is completed. 

Example 6.2 Consider the (8,4,4) RM code with the TOGM G as 


1 1 1 1 0 0 0 0 ' 
0 10 110 10 
0 0 11110 0 
0 0 0 0 1 1 1 1 


Suppose we want to construct a 4-section trellis for this code with boundary 
locations in U = {0,2, 4, 6, 8}. First, we find from G that the state space 
dimension profile is (0,2, 2, 2,0). Therefore, P 4 ,max(C) = 2. We also find that 


c oa = = C£ e - C& = {0}, 


PoA g o,2) = 


1 1 
0 1 


P^A G 2,a) = [ 1 1 ] > 


P4,«(G(’e) = [ 1 1 ] , *,.(<?&) = 0 - 


The state defining information sets at the boundary locations, 0, 2,4,6, and 8, 
are given in Table 6.1. Following the constructing procedure given above, we 
obtain the 4-section trellis diagram for the (8,4,4) RM code as shown in Fig- 
ure 6.4, where the states are labeled based on the information defining sets 
using p 4>raax (C)-tuples with P 4 , max (C) = 2. 

AA 
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Table 6.1. State defining information sets for a 4-section trellis for the (8, 4, 4) RM code. 



Time — y 

0 2 4 6 8 

A h 

0 {01,02} {a 2 ,a 3 } {a 2 ,0|} 0 


0 {ai} {03} {02,04} 0 

A h'i.hi*i 

{oi,o 2 } {03} {04} 0 0 


State label (<*1,02) (^2,^3) (<*2*04) 



Figure 6.4. A minimal 4-section trellis diagram for the (8, 4, 4) RM code with 2-bit state 
labels. 

Construction of T{JJ) can be achieved by using the state labeling based on 
a parity-check matrix H for C [101]. Let 

Hhj y hj+i ii * • • » (6.25) 

denote the submatrix of the parity-check matrix H of C that consists of columns 
from to Let l{&hj) be the label for the state <Jh r Then the 

composite branch given by (6.19) connects the state a\ Xy to the state <7A >+1 G 
£fc i41 (C) at time-hj+i that is labeled by 

+ (ph„ft J+1 («) + *>) • (6.26) 
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Eq.(6.26) gives the connection from a starting state <Th J at tim e-hj to a desti- 
nation state <Thj+i at time-/ij+i . 

To facilitate the construction of the (j + l)-th trellis section between time- 
hj and time-hj+i, we form a table, denoted at the completion of the 
construction of the j-th section. Each entry in Qh $ is a triplet, 

The first component / is a binary p^ -tuple formed by a specific combination 
of the phj information bits in A J . This p/^ -tuple defines a specific state a in 
S/ l> (C'). The second component l(<r) is simply the label of state The third 
component is given by 


C = Phi,fc> + 1(«) 

= (6.27) 

where u = / • G a h , is given by (6.18). 

Construction of the ( j + l)-th section of T(U) is carried out as follows: 

(1) Form C£ >h . +l and B hj , hj+l . 

(2) For every entry (/,/(<t),c) € Qh, and every 6 £ form the 

composite branch, 

+ (6.28) 

(3) For every starting state a € E/^C), and b £ the destination 

state a' E E^ J+1 (C) is labeled with 

l(cT') = l(<T) + (b+c).HZ' hi+l . (6.29) 


Repeat the above process until the L-section trellis T(U) is completed. 

The trellis construction procedures presented in Section 4.1, Section 4.2, and 
this section only provide the general steps of construction. A detail and efficient 
trellis construction procedure is given in Appendix A. 
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6.4 PARALLEL STRUCTURE 

Consider the trellis section from time-fy to time-h J+1 . For a given pi ljl h >+ r 
tuple a^, define the following two sets of states at time-hj and time-hy+i, 
respectively: 

S L (a hi ) * (6.30) 

and 

SR{a hj ) ± {a(a h> ,a;.) : a m h} € {0, } . (6.31) 

Then Sz,(a/^) and 5/e(a^) are subspaces of the state spaces, E^(C) and 
E/, i+l (C), respectively. From the state transition analysis given in the previous 
section, we observe the following: 

(1) Every state in Si(ahj) is adjacent to all the 2 P *>+ 1 ~ Ph i *->+ l states in 
•S/^a^) and is not adjacent to any other state in E/j >+1 (C). 

(2) Every state in 5/*(a/ l> ) is adjacent from all the 2 Ph j~ Ph j' h j+i states in 
S L (a hj ) and is not adjacent from any other state in E/i^C). 

Therefore, the states in S^a/^), the states in S/^a/^), and the composite 
branches connecting them form a completely connected subtrellis (known 
as a complete bipartite graph). Since there are 2 Ph >*>+ 1 possible 
tuple a^, there are 2 Ph j > 'j+ l such completely connected subtrellises in the 
trellis section time-h^ and time-hj+i. All these subtrellises are structurally 
identical (isomorphic), and there are no cross connections between them. 
These subtrellises are called parallel components. The parallel structure of 
a trellis section is shown in Figure 6.5. 

It follows from the definition of ph } ,h J+l given by (6.21) and the partition of 
the TOGM G shown in Figure 6.3 that 

ph„h j+l = K - k(C h] ,N) - H c o,hj+i) + k i c h„h j + l )• (6.32) 

Therefore, the total number of parallel components in the trellis section from 
time-hj to time-h ;+ i is 

2 A--fc(C fci , N )-fc(Co. s+1 )+k(C fcj .i. J+1 ) (6.33) 
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2^ fc i’S+i 

parallel 

components 




Figure 6.5. Parallel structure in a trellis section. 

From (6.2) and (6.33), we find that the numbers of states in S^a^) and 
are: 

2*(Co> i+ ,)-*(Co.» j )-fc(C fci , ki+1 ) (6.34) 

and 

(6 .35) 

respectively. Equations (6.30) to (6.35) completely characterize the parallel 
components in the (j + l)-th trellis section of T(U). 

Consider the 4-section trellis diagram for the (8,4) RM code shown in Fig- 
ure 6.4. There are two parallel components in both the second and third sections 
of the trellis. Each component has two states at each end. 

Analysis of the parallel structure of a sectionalized code trellis is presented 
in the Appendix A. 
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Figure 6.6. The 4-section minimal trellis diagram T({0, 4, 8, 12, 16}) for RM2,4- 

Example 6.3 Consider the RM2,4 code which is a (16,11) code. The 4- 
section minimal trellis diagram T(U) with section boundary locations in U = 
{0, 4,8, 12, 16} is depicted in Figure 6.6. There are two parallel and structural 
identical components in both the second and third sections of the trellis, and 
each component is a complete bipartite graph. Each component has 4 states at 
each end. Each state at boundary location 8 has 4 composite branches diverg- 
ing from it. For 1 < j < 4, = RM 2,2 ^ 4 (j-i), 4 j = RMo, 2 * 

Therefore, there are 2 parallel branches between any two adjacent states whose 
4-bit label sequences form a coset of RMo,2 in RM 2 , 2 - In both the second and 
third sections of the trellis, each coset in RM 2 , 2 /RMo ,2 appears 4 times as the 
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composite branch label. In fact, the entire trellis consists of two 4-section par- 
allel and structurally identical subtrellises without cross connections between 
them. The maximum state complexity is 8. Therefore it is possible to devise 
two identical trellis-based decoders, say Viterbi decoders, to process the entire 
trellis in parallel. This not only simplifies the decoding complexity but also 
speeds up the decoding process. 

AA 

The parallel components in a trellis section can be partitioned into groups of 
the same size in such a way that [44]: (1) two parallel components in the same 
group are identical up to path labeling, and (2) if there is a common label 
sequence in two parallel components, then they are in the same group. Since all 
the parallel components in a group have the same label set, in a trellis-based 
decoding algorithm, only the metrics of the branches in one of the parallel 
components need to be computed. This results in a reduction of branch metric 
computation. 

Let denote the subcode of C that consists of those codewords 

whose components from the (hj + l)-th bit to the hj+i-th bit positions are 
all zero. Then each group consists of 2 Ah -’ ,fc -' +l * C * identical parallel components 
where [44] 

x h Jt h i+ i(C) = fc(C'^,h J+1 ) - k(po,h J (Co,h J+I ) np 0 ,hj(<?hj,h j+l )) - Jc(C hj+itJV ). 

(6.36) 

Each parallel component can be decomposed into subtrellises with simple 
uniform structure [44] as shown in Figure 6.7. Consider a parallel compo- 
nent, denoted A. Let E^(A) and E^ J+1 (A) denote the state spaces at two ends 
of A. We first partition E/^A) into blocks, called left U-blocks, which satisfy 
the following condition: 

(Bl) If two states and in E/^A) are in the same left U-block, then 
they have the same set of diverging composite branches, i.e., 

{Lfohj > ^hj + i ) • € E^ + l (A)} 

= ^hj+i ) : ^>+1 € Eh >+i (A)} (6.37) 
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A left A U-block pair and A right 

U-block branches between them U-block 


2 Uk j h j+i( c ) 

states 


states 


states 



2*'SS+i (C) 

states 


2 *'*>•'*>+» (C) 
states 


Time 


hy+i 


Figure 6.7. Partition of a parallel component. 
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and otherwise 

:<r h 

> + i € E* i+ .(A)} 

n {£(< T V< T S+ .) : ^Ai+i € Eh >+I (A)} = 0. (6.38) 

We next partition E^ j+ 1 (A) into blocks, called right U-blocks, which satisfy 
the following conditions: 

(B2) If two states a h i+l and in Eh i+l (A) are in the same right U-block, 
then they have the same set of converging composite branches, i.e., 

{L(<r hi ,<r/, j+1 ) : <T hj € S^(A)} = {L(<T hi ,<r' h . +l ) : % € E J»i( A )} 

(6.39) 

and otherwise 

{L{<T hj ,(T hj+l ) : <r hj € 2a,(A)} n {L(<r h} ,a' hj+1 ) : a hj € S\(A)} = 0. 

(6.40) 


Each left U-block (or right U-block) consists of states [44], where 

(C') - fc (Po,h i+ .(^i,h>+i n(C 0 ,h i+I ® C hi ,N))) ~ k(C 0thj ). (6.41) 

A pair of a left U-block and a right U-block is called a U-block pair. It follows 
from the conditions (Bl) and (B2), that each U-block pair (B/i, has the 

following uniform properties: 

(1) For any two states + 1 and 0^ * n ^>+i» 

{L{a h> ,(Th^i) : (Thj 6^,} = { L { (T h J > a h J + l ) '■ a h, € B hj }. (6.42) 

(2) For any two states <7^ and a' h , in 

: <Th j+l € Bh i+1 } = {L(cr' h} ,<T hj+l ) : <r N+l € B hj+l }. 

(6.43) 
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The first property simply says that for a U-block pair the set 

of composite branches from states in the left U-block Bh } converging to any 
state in the right U-block Bh J+l is the same. The second property simply 
says that the set of composite branches diverging from any state in Bh i to 
states in Bh j+ , is the same. Two different U-block pairs have mutually disjoint 
composite branch sets. 
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